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Abstract 


This research addresses the problem of proving lower bounds on the complexity of alge- 
braic computations involving the floor operation. The model of computation considered 
is a computation tree with the set of basic operations {+,—,*,/,|-|, >}. The constants 
available to the computation are 0 and 1, and every other constant needs to be generated 
explicitly. The problems that are considered may be broken down into the following 
categories: 


1; 


bo 


Functions of a single input. A general lower bound technique is developed for a class 
of functions that have as their input a single n-bit integer. A characterization of the 
functions to which this technique applies is given. The characterization is general, 
and applies to many natural functions, such as perfect square root (deciding if the 
square root of the input is integral or not) and computing the value of |log log z]. 
Every function that with this characterization requires at least Q(./logn) opera- 
tions. 


Upper bounds are given which use the floor function in non-trivial ways. The most 
surprising result is that the computation of 2?" can be accelerated in the presence 
of the floor function. Using only rational operations (i.e. {+,—,*, /}), it requires 
O(k) operations to compute (by repeated squaring), and this bound is tight (for 
ratioual operations). In contrast, we show that with the use of the floor operation 
and an additional input greater than 22, the number 22" can be generated in 
©(Vk) operations. Using the upper bounds developed, an O(./logn) upper bound 
for computing {log log z| is given (where z is an n-bit integer). This upper bound 
matches the lower bound proved for this function. 


. Functions of two inputs. A very natural example of a function that has two integer 


inputs is the Greatest Common Divisor function. Grotschel, Lovdsz and Schrijver 
posed as an open problem the problem of showing that computing the greatest 
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Chapter 1 


Introduction 


It is very natural to define the complexity of an algorithm with respect to the number of 
operations that it performs. Computation trees give a way to formalize such a model. A 
computation tree, intuitively, has a set of basic operations, each requires unit time, and 
the complexity of the algorithm is measured with respect to this set of operations. (See 


Chapter 2 for both motivation and definition of the computation tree model.) 


Arithmetic operations, e.g., addition, multiplication, etc, are natural candidates for 
the set of allowable unit time operations. Deriving lower bounds for computations that 
involve arithmetic operations has received much attention. As one would expect, at first 
lower bounds were developed for limited sets of arithmetic operations, and with time, 
the results were extended to richer sets of operations. Probably the simplest model is 
one that has only comparisons, which is used to show an Q(nlogn) lower bound for 
sorting (see [Knu81]). A considerably richer model is one that allows linear operations 
in addition to comparisons. Lower bounds for computations with linear operations can 
be found in [Rei71, Dob76, DL78, DL80, Yao75, YR80]. As for rational operations, i.e. 
{+,—,*,/}, the first lower bound is [Rab72], which is an information theoretic lower 


bound. 
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inputs. If the size of numbers is not restricted then “too many” functions are computed 
with a polynomial number of operations. For example, any PSPACE-computation can 
be simulated in polynomial time [BMS81, PS76, Sim81]. Moreover, it is known that hard 
problems, e.g., factoring, can be solved in linear time [Sha79]. Restricting the size of 
the intermediate results to be polynomial in the size of the inputs ensures that strongly 


polynomial time is contained in polynomial time. 


Lower bounds techniques for computations that use the floor operation have been 
much less successful than for rational operations. Perhaps one reason for the lack of 
progress in this area is that this set of operations does not possess the “nice” algebraic 


properties that rational functions possess. 


Previously there have been a few lower bound techniques developed to handle the 
floor operation. In [BJM88], a computation tree that operates on real numbers and has 
the floor operation in its repertoire was studied. They show, using topological arguments, 
that there are certain classes of languages which can not be decided by analytic compu- 
tation trees. [JMWS89], compares the expressive power of computation trees with various 
sets of operations, and proves lower bounds for computation trees with the operations 
{+,-,DIV.}, where DIV, denotes integer division by constants. Based on proof tech- 
niques of [JMW89], [Bas90a] shows an 2(n/ log n) lower bound for computing the GCD, 
using {+,—,|-],xc,/c}, where x. and /. are multiplication and division by constants. 
(Note that in such a model the mod operation cannot be performed in O(1) operations.) 
The work of [IMR83] discusses the relation of the floor operation and indirect address- 
ing. In ([DO85, Jia79], an Q(nlogn) lower bound for sorting rational numbers, using a 


restricted floor operation, is given. 


The aim of our research is to establish lower bounds for computations that use the 
floor operation, in addition to rational operations. The main motivation for considering 
the floor operation, is that it is simple to implement in practice, and that it exists as a 


basic operation in most computers and programming languages. 
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the average error is less than ¢, requires at least (,/log log 4) operations. (In the lower 
bound we assume that the computation receives only the input z € [1, 2], as in the case of 
Newton iterations, and tries to produce the best approximation it can, given a bound on 
the number of operations.) In a somewhat different setting, we show how to approximate 
the s“ root, for any x € [1,2]. The approximated value, for any input z € [1,2], is at 
most € away from \/z, and the number of operations is O(,/loglog +). (In the upper 


bound, the computation receives both the input x € [1,2] and the accuracy parameter 
€.) 


All our lower bounds assume that we have to generate all the constants that are used 
in the computation explicitly. We show that without this restriction any polynomial 
could be computed in O(1) operations (independent of its degree). In order to avoid any 
possible confusion, we quantify our lower bounds by stating that the computation has 


initially only the constants {0,1}. 


The research in this work was done in collaboration with Baruch Schieber and Prasoon 


Tiwari, and preliminary versions of it appear in [MST88, MST89b, MST89al]. 


This thesis is organized as follows. In Chapter 2 we define the computation model 
and prove a few results concerning polynomials and rational functions. In Chapter 3 
we develop the proof technique for one variable functions. In Chapter 4 we extend the 
technique from one variable to two variables, and show the lower bound for computing 
the greatest common divisor of two integers. In Chapter 5 we show the lower bounds for 
approximation. In Chapter 6 we show the upper bounds. In Chapter 7 we summarize 


the results and suggest directions for future research. 


Chapter 2 


Preliminaries 


2.1 Computation Tree Model 


In this section, we define the computation tree model. Let us motivate the definition 
chosen. Consider a specific computer, it has some set of instructions that are built into 
it. A program that runs on that computer uses this set of operations, usually referred to 
as “machine instructions”. A natural measure of the time complexity of a program is the 
number of machine instructions that it performs. Although this approach assumes that 
each machine instruction requires the same amount of time, in many cases this gives a 


good approximation. 


This means that it is natural to fix some set of operations, and consider them as basic 
operations. The time complexity of computing a certain function is the number of basic 


operations performed. 


We would like a formal model that would make it “easy” to define time complexity. 
If the set of basic operations does not include a branching operation, then the number of 


operations performed, on any input, is simply the number of operations in the program. 
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Definition 1 A computation tree T consists of a labeled binary tree, and a set of 


operations OP. Each vertex of the tree is labeled in one of the following ways. 


e Input vertices: An input vertex is labeled by a certain input, e.g. x1. Each input 


vertex has at most one child. 


e Computation vertices: Each computation vertex u is labeled with an operation f, = 
0(v1,...,Uz), where o € OP, and v; is either a computation or an input vertex that 
appears on the path from the root to node u. Each computation vertex has at most 


one child. 


e Comparison vertices: Each comparison vertex u is labeled with v; < v2, where, 
again, v; is either a computation or an input vertex that appears on the path from 


the root to node u. Each comparison vertex has two children. 


Given an input the computation tree defines a computation. The computation tra- 
verses a path from the root to a leaf and assigns values to the input and computation 


vertices on the path in the following way. 


The computation starts at the root of the tree T, and proceeds as follows: 


1. When the computation arrives at an input vertex u, labeled by z;, it assigns u the 


value of the input z,;, and continues to the child of u. 


2. When the computation arrives at a computation vertex u, labeled by fy = op(v4,..., 
vz), it assigns u the value val(u), which is op(val(v,),..., val(vg)), where val(v;) is 
the value of vertex v;. Note, that since v; appears on the path from the root to u, 
the computation already assigned some value to v;. The computation continues to 


the child of wu. 


3. When the computation arrives at a comparison vertex u, labeled with v, < va, it 
proceeds to the left child of u, if val(v,) < val(v2), and to the right child of u, 


otherwise. (A comparison node does not have a value.) 
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2.2 Polynomials and rational expressions 


This section includes some basic definitions and results about polynomial and rational 


expressions. These results will be used extensively in the following chapters. 


The degree of a polynomial P(r1,...,2,%) with respect to a variable z,;, denoted 
deg,,(P), is the maximum exponent of z; appearing in any monomial of P(z1,...,2,). 


The degree of P, denoted deg(P), is max; deg,,(P). 


The size of the coefficients plays a crucial role in the lower bounds that we prove 
later. For this reason we add another parameter, which is the mazimum coefficient of P, 
denoted maz-coef(P). The value of maz-coef(P) is defined to be the maximum among 


the absolute values of the coefficients of P. 


We extend the notion of degree and maximum coefficient of a polynomial to a set of 
polynomials, in a rather straightforward way. For a set A of polynomials, the degree of 
A, denoted deg(A), is maxpea deg(P). Similarly, the mazimum coefficient of A, denoted 


maz-coef(A), is maxpe, maz-coef( P). 


The following lemma relates the degree and maximum coefficient of polynomials and 


the operations {+,—, *}. 


Lemma 2.1 Let P(2,,...,2%) and Q(x,,...,2,%) be two multivariate polynomials. 


Then, 


1. deg P + Q) < max{deg(P), deg(Q)}, 
2. maz-coef(P + Q) < maz-coef( P) + maz-coef(Q), 
3. deg(P *Q) < deg(P) + deg(Q), and 


4. maz-coef(P *Q) < (1+ min{deg(P), deg(Q)})*maz-coef{ P)maz-coef{Q). 
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A lexicographic order on polynomials 


We define a lexicographic order on the set of multivariate polynomials. For this purpose, 

we use the following lexicographic order on the set of multivariate monomials. 
Definition 2 For two monomials cx}! ...xj* and dx? Jat, the relation cr} ...rk 

> da? .. walk holds if either (1) there exists an m, such that im > jm, and for anyl <m, 


y= ji, or (2) = ji, for’ <1 <k, and |e| > {d. 


We say that a polynomial is written in its normal form if it is written as a minimal 
sum of monomials, and these monomials are sorted in descending lexicographic order. 


We assume that all polynomials are written in their normal form. 


Definition 3 Fortwo multivariate polynomials P(21,...,2%) and Q(21,...,2%), P(21, 
1225 2k) > Q(a1,...,2%) Uf, when written in their normal forms, there exists some i > 1, 
such that (the 7-th monomial in P) > (the i-th monomial in Q), and all the monomials 


preceding it are identical in both P and Q. 


Given a polynomial P(z1,...,z,), let the leading monomial of P be the first monomial 
in the normal form of P(x1,...,2,). Let the leading coefficient of P(x,,...,2%) be the 


coefficient of this monomial. Define 


+1 for a>0O 
sign(a) = 0 for a=0 


-—1 for a<0 


The sign of a polynomial P, denoted by sign(P), is the sign of the leading coefficient of 
P. Note that sign(P) = 0 if and only if P is the zero polynomial. A sign of a rational 


expression A(z) = aa, is sign(P) sign(Q). 
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To complete the proof we show that if b > 1.(P) = Mth then (1 — 7%, Ex) > 0, or 


equivalently, 77.4 Eel <1. This later inequality follows from 


el Mice ear 4 es GNP ME > Es 1 
ea Se < >> <> Lees a 
a bk tO oars Lb AML LM + Li a 
Similarly, we can prove the Lemma for the case L < 0. 0 


From Lemma 2.3 we can infer a sufficient condition for univariate polynomials. 


Corollary 2.4 For each polynomial P(y), there exist a positive integer 7(P) such 
that for allb > x(P), sign(P(b)) equals sign(P). Furthermore, 7(P) < maz-cochP) +1, 
where L is the leading coefficient of P. 


Corollary 2.4 enables to argue along the following lines. Consider a comparison be- 
tween Q(x) and P(x), where both Q and P are polynomials. By Corollary 2.4, there 
exists a value e = ™(P—Q), such that for x > e, either P(x)—Q(z) = 0, P(x)-—Q(ax) > 0, 
or P(x) — Q(x) < 0. This means that, for z > e, the value of the comparison between Q 
and P is fixed. In a similar way, Lemma 2.3 allows us to argue about the comparison of 


bivariate polynomials. 


Polynomial division 


The lemma below shows how the degree and the maximum coefficients of bivariate poly- 
nomials are affected, when we perform polynomial division. Note that the following 


lemma is restricted to the case that the leading monomial has only one variable. 


Lemma 2.5 Let P(z,y) and Q(z,y) be two bivariate polynomials with integer coeffi- 
cients. Let maz-coef{P) = M, maz-coef(Q) = N, and 6 = max{—1, deg,(P) — deg,(Q)}, 
If Lz* is the leading monomial of Q(2,y), where d > 1, then there exists A(z,y) and 
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Therefore we can bound the maximum coefficient of AR, using the inductive hypothesis, 


as follows, 


maz-coef(R) < (2 + deg,(Q))*’maz-coef(S)N° 


IA 


(2 + degy(Q))® ((2 + deg,(Q)) MN) N° 


(2+ deg,(Q))°t! MNO*, 


lA 


Furthermore, deg,(f) < deg,(Q){= d}, and deg,(R) < deg,(S) + (6 — 1)deg,(Q) 
S deg,(P) + 6'deg,(Q). 


In a similar way the bounds on maz-coef{A) and deg(A) follow. Oo 


For most of our applications the following simplified version of Lemma 2.5 would be 


sufficient. 


Corollary 2.6 Let P(z,y) and Q(z,y) be two bivariate polynomials such that deg(Q), 
deg(P) < D, and maz-coef( P), maz-coef(Q) < M. If Lx* is the leading monomial of of 
Q(z,y), then there exists A(x,y) and R(z,y) such that 


1 1 
P(z,y) = pu Ale, )Q(z,y) + pn Mle.) 


where 6 = max{—l, deg,(P) — deg,(Q)} < D. Furthermore, A(z,y) and R(z,y) are 


polynomials with integer coefficients, that satisfy, 


1. maz-coef(R) < ((D + 2)M)P*?, 
2. maz-coef(A) < ((D + 2)M)?*}, 
3. deg,(R) < D, and deg,(R) < (D +1)D. 


4. deg,(A) < D, and deg,(A) < D. 


Chapter 3 


One Variable Functions 


3.1 Overview 


Proving that a polynomial cannot compute a certain boolean function, using only a 
certain number of {+,—,*} operations, can be done by arguing about the degree of 
the polynomial. In order to make the case more interesting, assume that we relax the 
requirement on the output of the polynomial such that it evaluates to a non-negative 
number if and only if the computed function is TRUE on that input. On the one hand, 
the number of alternations between TRUE and FALSE of the output of the computed 
function gives a lower bound on the degree of the polynomial. On the other hand, in order 
to evaluate a polynomial of degree d, one must perform at least logd multiplications. 
This degree argument is independent of the size of the coefficients of the polynomial, 
and applies even if the input is restricted to be integer. The degree argument can be 
extended also to rational functions, and to computation tree with rational operations 
(i.e. {+,—,*,/}). Stockmeyer [Sto76] showed, using a similar argument, an 2(n) lower 
bound for the depth of any computation tree that computes the parity, for any n bit 


integer, using rational operations. 
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Doing it in a straightforward way may cause a “huge” increase in the degree. 


For example, consider the expression lg 


ZI, where zx is an n bit integer. If R is a 


rational function such that R(x) = Be then R has degree 2”~1. We are interested in a 
transformation that keeps the degree and the coefficients “small”. In this specific case, 
a good solution would be to set [z| = 0, and restrict x to integers that are divisible by 


2, i.e. only even integers. 


The general solution has a similar flavor. We restrict the input, so we can set the 
value of the floor operation equal to a rational function. In this transformation, the 
degree remains the same, and the size of the coefficients may grow at most exponentially 
in the degree. Based on this technique, we show how to derive lower bounds for a decision 


tree. 


In Section 3.3 we show how to derive lower bound for various functions, using the 
general technique. Although our proof technique is for decision problem we can use it 
to derive lower bounds for computation problems as well. In the cases that we consider, 
the decision problem is the parity of the output of the computation problem. Since the 
floor is a basic operation, we can compute the parity from the output in O(1). This 
implies that for such decision problems, the computation problem is at least as hard as 


the decision problem. 


3.2 Proof Technique 


We start by defining the subsets of the input that will be used in our proof. 


Definition 4 Let the set S(n,X) be the set of all n-bit integers that are multiples of 


an integer X. 
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The interesting case of the proof technique is the floor operation. This is by far more 
involved than the previous cases. After proving this lemma, we show how to combine 
the techniques we developed to derive a lower bound for decision trees. Recall that we 


restrict the floor operation to non-negative inputs. 


Lemma 3.4 Let P be a rational expressions with integer coefficients, of degree at 
most D and maximum coefficient at most M, defined over S(n,X), such that P(x) > 0 
for any x € S(n,A). There there exists t < (2M)?*?, and a rational expression Q with 
integer coefficients, where deg(Q) < D and maz-coef(Q) < (2M)?+', such that for any 
2 € S(n, An), Q(z) = [P(2)]. 


Proof: If sign(P) = 0, i.e. P is the zero function, then let Q(x) = 0/1 and 7 = 1, 
and the lemma follows. Otherwise, let P(r) = ot Without loss of generality we may 
assume that szgn(P2) = +1. We would like to restrict our attention to a subset of the 
inputs, for which the sign of P(x) is fixed. By Corollary 2.4 for z > max{m(P,), 7(P2)}, 
sign(P(x)) equals sign(P), which equals sign(P,)sign(P2). Since the operand of the floor 


is non-negative, sign(P) = +1, which implies that sign(P,) = +1. 
We consider three cases of relations between P, and P». 


Case 1: P\(x) = P(x). This is the trivial case. Let Q(x) = 1/1, and the lemma follows. 


Case 2: P\(x) ~ P2(x). Let B(x) = P(x) — P(x). Since the leading coefficient of Py» is 
positive then szgn(B) = +1. Corollary 2.4 guarantees the existence of a positive integer 
m™(B) such that B(a) > 0, for all a > max{m(P,),7(P2),7(B)} = 7. Since P2(x) > 0, 
the expression P;(x) — P,(x) > 0 is equivalent to 1 > #4 = P(z). Since P(z) > 0, 
for zg € S(n,i), 1 > aig} > 0, for a € S(n, Am). We conclude that for any integer 


a € S(n,An), |P(a)| = 0. Let Q(x) = 0/1. Since r < 2(D + 1)M? +1, the lemma 


follows. 


3.2. PROOF TECHNIQUE 33 


We need to bound the parameters 7, deg(Q) and maz-coef(Q). To bound deg(Q) 
and maz-coef(Q) it is sufficient to consider the polynomial A(x) + cL*. Clearly, 
deg( A(x) + cL*) < D. The value of maz-coef(Q) is the maximum between L? and 
maz-coef( A(x) + cL). By definition, maz-coef{ A(x) + cL) < maz-coef( A) + L’. 
Using Corollary 2.7, maz-coef(A) <2?M?t!. Therefore maz-coef(Q) < (2M)?*}. 


To bound the value of 7, we bound 7(B); the bounds for m(yP2 + R), (Pi), 
and m(P)) are smaller, and derived similarly. Recall that L¢B(z) = L*P,(x) -— 
(yP2(z)+R(z)). Therefore, maz-coef(L4B) < (L*—~) maz-coef( P2) + maz-coef( R) 
< M?+1M + 2P+1\f+2, Since L > 0, for any z, sign(B(x)) = sign(L*B(z)). 
Hence, in order to bound 7(B) it is sufficient to bound 7(L7B). Since L7B(z) is a 
polynomial with integer coefficients, its leading coefficient is at least one. Therefore, 
by Corollary 2.4, 7(L?B) < maz-coef(L7B) +1. By venue that m is divisible by 
L*, we may add L4 — 1 to 7(B), therefore, r < #(L4B)+1+(L4—1) < (2M)?*?. 


Subcase 2: y = 0. Clearly R(x) ~ L*P,(x). Using Corollary 2.4, let 7 be the 
minimum multiple of ZL? such that > max {(R), t(P,), t(P2), t(L7P2 + R)}. 
Then, for all a > ™, prg@> = 0, if sign(R) = 0,0 < rey <1, if sign(R) = 41, 
and —1 < pasty < 0, if sign(R) = —1. Recall that the free term of A is L’c + . 
Let ¢ be c if sign(R) > 0, and c — 1 if sign(R) = —1. We conclude that for each 
a € S(n, An), | i | = L-4A(a) + @. Let 


= |P(x)| for z € S(n, Az). 
We can bound the parameters 7, deg(Q) and maz-coef(Q) as in the previous subcase. 


Since one of the above relations has to hold between P; and P:, the lemma follows. O 


To summarize, so far we considered each operation by itself. Lemma 3.4 guarantees 


that each floor operation can be replaced by an evaluation of a rational expression. 
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3. Let ©; = {F.,,G., | vj is a computation vertex, j <i}. Define D; = deg(X;) +1 


and M; = maz-coef(¥;). Then, Dj < 2', and max{X, Mj} < gat? 


We show how to define A+?) and how to choose the vertex vj42 such that Properties 
1-3 will be satisfied by the set S(n, A“+)) and the prefix of P that starts at v,;, and ends 
at viz2. The parameter A+”) will be a multiple of \®. By Claim 3.1 this implies that 
S(n, AS) © S(n, A“), therefore, by the induction hypothesis we have that Properties 
1-3 are satisfied by the set S(n,A+!)) and the prefix of P that starts at v; and ends 
at viz1. In order to complete the proof of the lemma we need to show that (a) there 
exists an outgoing edge of v;4; such that for each input a € S(n,A\°+)) the computation 
follows this edge, and (b) Properties 2-3 are satisfied also for the vertex v;,1 and the set 


S(n, MAN), 


By the definition of the tree T, the vertex v;4; is either an input vertex, a comparison 


vertex or a computation vertex. For an input vertex both (a) and (b) hold trivially. 


For a comparison vertex we need to show that (a) holds. Let the comparison be 
g = h, such that g,h € {0,1} U{f,, | vj; is a computation vertex, 7 < 7}. By the 
induction hypothesis both h and g can be represented as rational expressions of degree 
at most D; < 2', and maximum coefficient M; < ga By Lemma 3.2, there is a 


AED << 2AM (D; + 1)M? + AO) < gnu For a comparison vertex (b) holds trivially. 


For a computation vertex (a) holds trivially, so we need to show only that (b) holds. At 
a computation vertex, either f,,,, = goh, foro € {+,—,*,/}, or fu,,, = Lg| is evaluated, 
where, g,h € {0,1} U{f,, | v; is a computation vertex, j < i}. Ifo € {+,-,*,/}, 
then the induction step follows from Lemma 3.3. If f,,,, = |g], then, by Lemma 3.4, 
there is a rational expression Q, such that Q = |g]. By Lemma 3.4, Diy: < D; < 2', 
Mist < (2M;)241 < o(1+24” )(2'+1) < gate? and A+) < \O(2M,) P+? < pH? og 
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The following corollary states that for M(n) = (2°), the lower bound is still 2(./logn). 


This corollary is used later to derive an 2(./logn) lower bound for various functions. 


Corollary 3.7 Let f be an M(n)-invariant function, such that M(n) = Q(2™) for 
some fired € > 0. Then any computation tree with OP = {+,—,*,/,|-|}, that computes 
f(z), for all n-bit integers, must have depth 2(/logn). 


Using Corollary 3.7 we show lower bounds for the following problems. 


1. Decide if [log a| is odd or even, for any n-bit integer a. (choose M(n) = 2"7! 


a, = \ and az = 2A.) 


bs 


2. Decide if |logloga| is odd or even, for any n-bit integer a. (choose M(n) = 2"/?, 


a, = A and a2 = A?.) 


3. Decide if \/a is an integer. (choose M(n) = 2"/?-1, a, = \? and az = 2°.) 


Theorem 3.6 gives an 10(,/log n) lower bound on the depth of any decision tree with 
OP = {+,-,*,/,|-|}, that solves the above problems. 


At first sight it seems that all of the above lower bounds are for decision problems, 
and may be unrelated to the similar computation problems. In general this may be the 
case, but our examples have the property that the decision problem is closely related to 
the corresponding computational problem. For all the above decision problems, given a 
solution to the corresponding computation problem, one can decide the decision problem 


in O(1) operations. 


Chapter 4 


Lower bound for GCD 


4.1 Overview 


In this chapter we extend the proof technique developed in Chapter 3 to functions with 
two variables and apply it to show a lower bound for computing the greatest common 
divisor of two integers. Although this chapter does not depend on chapter 3, it is advisable 


that the reader would read chapter 3 prior to reading this chapter. 


Before explaining the lower bound technique for functions of two variables, we argue 
why the technique for one variable does not extend immediately to two variables. The 
most obvious reduction would be to concatenate the two inputs, each of size n bits, to 
one input of size 2n bits, and define the function with respect to this single variable. In 
order for the reduction to work, we need to argue that given this input, we first extract 
the two n bit integers, and continue the computation using them. The problem is that 
in order to extract the two integers one may be required to perform Q(log n) operations. 
This Q(log n) additive factor would be more than the lower bound that we can prove for 


the one variable function. 
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4.2 Inputs structure 


We start by defining the subsets of inputs that we will consider in the proof. Recall that 
for the one variable case, the subsets of the input that we considered were all the integers 
divisible by some integer. Unfortunately, the structure of the subsets of the inputs is 


substantially more complex in the two variable case. 


Definition 6 Letr, a2 and az be non-negative integers, and a; a positive integer. Let 
A = (61, 62,...,6,.) and A = (Xj, A2,...,A,) be r-dimensional vectors of positive integers. 


For positive integers uo, ui, ur, and ur41, the pair (uo, ui) is < 7, a1,02,03,A,A >- 


generated by the pair (u,,u,41) if there exist positive integers u2,uU3,...,Ur—1 such that: 
Uu = dy (uy)! + U2; (4.1) 
uy = Ag(u2)? + us, 
Uuo= Nigr (Wig) + + Ui42; 
Ur-p = Ny (tr)™ + Urdi; 
tie > dae (4.2) 


Ur¢1 > C2, 


Ur¢1 =1 (mod as). (4.3) 


Up 


In this case, (u,,Ur41) is the < r,a1,a2,a3,A,A >-generator of (uo,u,), and wu, us, 


-++, Urq is the <r,a1,Q2,a3, A, A>-generating sequence for (uo, u1). 
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The following lemma shows that when we add another variable, and thus adding 
another element to A and A, we restrict the “allowable” inputs to be a subset of the 


previous “allowable” inputs. 


Lemma 4.2 (The Containment Property): Let A’ and A’ be (r+1)-dimensional 
vectors of positive integers, obtained from r-dimensional vectors A and A by appending 
positive integers 6 and X, respectively. Then, S(r+1, a1, a2, a3, A’, A’) C S(r, a1, a2,03,4, 
A), provided 6 >a, and} =0 (mod az). 


Proof: Suppose that (ag,a1) € S(r + 1,a1, a2, a3, A’, A’), and let a2 > a3 >... >a, > 
Gr41 > Gry be its generating sequence. By definition, a, = rae, + a,42. Therefore, 
a, = 1 (modas), and a, > afi,. In addition, a,41 > ay}, > ag. Hence, (ao,a1) € 


S(r, Q1, @2,a3, A, A). O 


4.3 The proof technique 


As in the proof technique for one variable, we start by showing how to handle a compar- 
ison. We show that we can restrict the inputs to the comparison, so as to ensure that 


the value of the comparison is fixed. 


Lemma 4.3 Let P(z,y) and Q(z,y) be two rational expressions with integer coeffi- 
cients, of degree at most D and mazimum coefficient at most M, defined over S(0, ay, a2, 
a3). There exists a, < a < max{a1,2D+1}, and a2 < a4 < max{a2,2(D+1)?M?+1}, 
such that for all (a,b) € S(0,a),04,a3), the comparison between P(a,b) and Q(a,b) is 
determined (i.e. either P(a,b) = Q(a,b), P(a,b) > Q(a,6), or P(a,b) < Q(a,d), for 
(a, 6) € S(0, a4, a4,a3)), and S(0,a4, 04,03) C S(0,a1, a2, a3). 


Proof: If P(z,y) = Q(z,y), for (z,y) € S(0,a1, a2, a3), then let af = a, and a} = a2, 


and the lemma follows. 
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1. There is a rational expression Q(z,y) with integer coefficients, and integers a, a, 
aw such that 
[P(z,y)] = Q(z,y) 


for each (x,y) € S(0,a4,a5,7a3). Furthermore, max{a}, deg(Q), } < 4D*, and 
max{ mas, a, maz-coef(Q), } < 2(4D?M)?", or 


2. There is a rational expression Q(y,z) with integer coefficients, and integers a}, ay, 
mw, A\ and 6 such that 
[P(z,y)] = Q(y, 2) 


where (y,z) is the generator of (x,y) € S(1, a4, a4, ta3, {6}, {A}). Furthermore, 
max{ai, deg(Q), 6} < 4D*, and max{ras, a’, maz-coef(Q), 4} < 2(4D?M)2™, 


Proof: Let d, = deg,(P2), dy = deg,(P2), and let Lar**y' be the leading monomial of 
P,(z,y). Without loss of generality we may assume that L > 0, i.e. sign(P2) = +1. 


In order to simplify the proof, we would like to fix the inputs such that sign(P(z,y)) = 
sign(P). By Lemma 2.3, there are two integers 7(P,) and 72(P,), such that for each 
(u,v) € S(O, m1(P1), m2(P1), 1), sign(Pi(u,v)) = sign(Pi). By a similar argument there 
are ™(P2) and 72(P2) such that for each (u,v) € S(0, 71(P2), 72(P2), 1), sign( P2(u, v)) 
= sign(P,;) = +1. Since the input to the floor operation is non-negative, sign(P) = 
+1. Recall that szgn(P) = sign(P,)sign(P2); this implies that sign(P,) = +1. Let 
@, = max{ay,71(P,),71(P2)} and & = max{az,72(P,),72(P2)}. Note that for each 
u,v) € S(0,d1, 42,1), both Pi(u,v) > 0 and P2(u,v) > 0. Since the coefficients of P, 
and P» are integers, by Lemma 2.3, a3 < D+1anda2<M+1. 


In the remainder of the proof we consider the following five cases in the following 


order: 


i: Pi(z,y) = P2(2,y). 


Re Pi(z,y) ~< P2(z,y). 
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and ¥ is a non-negative integer such that 0< y < L. Let r = L, a = dy, and ay = dp. 


Then, for each (u,v) € $(0, a}, a, Ta3), 
P\(u,v) mod L = P,(1,1) mod L=7, 


Since u =v =1 mod L. By writing the mod operation explicitly, we have, 


P,(u,v) = Ao L+y 


Therefore, for (x,y) € S(0, a}, a5, 7a3), 


=, Q(z,y) 


Ea _ Prue) —7 


L L 


Clearly, deg(Q) < D, maz-coef(Q) <2M,andr=L<M. 


Case 4: The leading monomial of P,(z,y) is Lx*, i.e., no power of y appears in the 
leading monomial of P2(z,y). We use Corollary 2.6 to divide P,(z,y) by P2(z,y) as 
polynomials in z. By Corollary 2.6 there are polynomials A(z,y) and R(z,y) with 


integer coefficients such that 


A(z, y) Po(z,y) + R(z,y) 


Pi(z,y) = L4 


Note that since Case 3 does not hold, deg,(P2) > 1. Furthermore, the following properties 
hold: (i) d < D, (ii) deg,(R) < deg,(P2). (iii) maz-coef(R) < ((D + 2)M)?+?, (iv) 
maz-coef(A) < ((D + 2)M)?*1, (v) deg,(R) < D, and deg,(R) < (D+1)D. (vi) 
deg,(A) < D, and deg,(A) < D. 
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Consider input (u,v) € S(0,a4,@4, a3). Then, both V(u,v) < P)(u,v) and 0 < 
V(u,v). Since P2(u,v) > 0, this implies that, 
V(u,v) — yPo(u,v) + Ru, v) 


OS Rien) ae PiGa) 


We conclude that for each (x,y) € S(0, a}, a4, a3), 


Ta = Q(z, y). 


[P(z,y)] = 
Clearly, deg(Q) = deg(A) < D, and maz-coef(Q) < maz-coef(A) + M4? < ((2+ 
D)M)?*! + M®? < 2((D + 2)M)?t?, 


Subcase 4.2: y = 0. In this subcase we restrict the input, such that the value of 


| Ae Re) | is either 0 or —1, depending on sign(R). Define R(x, y) as follows: 


u,U 


Reis R(x, y) if sign(R) > 0 
L¢P,(z,y)+ R(z,y) if sign(R) = -1 
Note that, . 
R(z,y) _| R(z,y) 
ae | . Ae 7 Faas . 


where p(R) = 0, if sign(R) > 0, and p(R) = —1, if stgn(R) = —1. The leading 
coefficients of the polynomials R(x,y) and L4P;(x,y) — R(z,y) are positive, assum- 
ing that sign(R) 4 0 (the case sign(R) = 0 is trivial). Recall that = L*. Using 


Lemma 2.3, let 


a4 = max{d1, ™1(R), 7 (L7P, = R)} < D(D + 1) 


AN 
S 
Ee 
aa 
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We substitute z by Ay’ + z. Therefore, deg(Q) < D(D +1) = D and maz-coef(Q) 
< \P2P Mu < 22MP+1 = M. By Using Case 4, we get a rational expression Q’(y, z), and 
a’, a and m, such that for each (x,y) € S(1, a}, a, mas, {6}, {A}), 


Q'(y,z) = [Q(y,2)| = LP(z,y)] 


where (y,z) is the (unique) generator of (x,y). Now substitute z = x — Ay® in Q'(y, 2) 


to get Q(z,y). We compute the following bounds for Q, 


max{deg(Q),ai} < (D1) 


IA 


(D(D +1) +1)? 


IA 


4D%, 


and 


max{maz-coef(Q),a5,7a3} < 2 ((D + 2)M) pee 
< 2((D(D +1) +2)(22MP4))" 
< 2(4D2M)”", 


Since the relation between P, and P2 belongs to one of the five cases, this concludes 


the proof of this lemma. 0 


The following lemma combines the previous lemmas into a proof technique for the two 
variable case. This lemma shows, that given a computation tree, one can find a subset, 
S(r, @1, @2, a3, A, A), such that all the inputs from this subset follow the same path. Once 
we establish this property, we can derive a lower bound for any boolean function that, 
for any such subset, has at least two elements which differ in their outcome. In the next 


section we show that deciding if two integers are relatively prime is one such function. 
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1. For each input (a,b) € S® the computation follows the path from the root to vj41; 


2. For each computation vertex v on the path from the root to the vertex vj41, ex- 
cluding the vertex v;41, there is a pair of bivariate polynomials (F'(z,y), Gi(z, y)) 
with integer coefficients, such that for each input (a,b) € S®, Gi(u,v) # 0, and 


val(v) = aes, where (u,v) is the < ral) af af AO A >-generator of 


(a, 6); and 


3. Let L; = (Fi) Gey) | 7 < ti}. Define D; = max{deg(X;),4} and M; = 
max{ maz-coef(;),4}. Then, rf <3, max{a), D;} < 27" and max{a$), af), M;} 


<2? 


We construct the set S@+!) C SW such that Property 2 is satisfied also for the vertex 
vi41 and each input (a,b) € S“+)), We also select an outgoing edge of v;4; and prove 
that for each input (a,b) € S“+") the computation follows this edge. It is easy to check 
that r+) <7+1. Therefore, in order to complete this proof, it is sufficient to show that 


the following two inequalities hold: 


(i) max{al't)) | Diag} < py 


7 . 4(1+1) 
(ii) max{a$t? aft) Mii} <2. 


Let us first resolve the case when v;4; is a comparison vertex. Then by Lemma 4.3, we 
can extend P. Furthermore, afi) < 2D; +1 <2" and at) < 2M?(D;+1)?+1< 


4(141) 
ge Clearly, inequalities (i) and (ii) hold in this case. 


Next, consider the case when v;4; is a computation vertex. We divide the discussion 
to two cases. The first case is when the operation is a rational operation, and the second 


case is when the operation is floor. 


Suppose v;41 is vo 4, where v and p are previous computation vertices, and o € 


{+,—,*,/}. By the induction hypothesis, val(v) = ey and val(u) = ees where 
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(ap, a1) = ((1 + ag3)*+%, (1 + a)*), and (bo, b:) = (1 + a3(1 + a3)FV, (1 +.a3)*). In 
order to see that (ao,a1) € $(0,a1,a2,a3), one can verify that (1) ao > aj’, (2) a1 = 
(1+a3)* > a2, and ag = ay = 1 mod az. (A similar argument holds for (bo, d:).) Clearly, 


gcd(ao, a1) = (1 + a3)* £1, and ged(bo, b1) = 1. 


In order complete this lemma, it is sufficient to prove each of the numbers apo, 41, bo, 
and }, is less than 27('+1), Since e is the least exponent such that (1 + a3)* > ao, 
then (1 + a3)*° < 27°. The desired upper bounds are an immediate consequence of this 


observation. Oo 


Intuitively, the previous lemma shows, that if we terminate with a large subset S(-), 


there is still a pair of inputs, one of which is misclassified. 


Theorem 4.8 Any decision tree with OP = {+,—,*,/,|-|} and constants {0,1}, 
that decides if a and 6 are relatively prime, for all integers 2" > a > b > 0, must have 
depth O(log log n). 


Proof: Without lost of generality we may assume that the leaves are labeled by the 
constants zero and one, and that the two first vertices are input vertices. (This can 
increase the depth of the tree by at most four.) Suppose that we are given a decision 
tree T of depth h < }loglog(n!/*), with OP = {+,-,*,/,|-|}, that decides if a and 
6 are relatively prime, for all integers 2” > a > 6 > 0. By Lemma 4.6 we have the 
following: (i) there is a path P from the root of T to a leaf v, and a set of inputs 
S = S(r,a1,a2,a3,A,A)M {(a,6) : 0 < a,b < 2"}, such that for each input (a,b) € S, 
the computation follows the path P; (ii) The leaf v is labeled by a constant (zero or 


1/5 


aid 4h 4h 
one). (iii) ay < 2%" < n™/5, a, a3 < 2”) < 2" maz-coef{ #14) < 2", and 


Gy(z,y) 
deg( He4 ) <n'/5_ Our goal is to arrive at a contradiction using Lemma 4.7. 


Towards this end, let t = n!/5 — 1. We claim that each pair (u,v) € S(0,a1, a2,03)M 
{(u,v) : 1 < u,v < 27(41)} generates a pair (a,b) € S = S(r, a1, a2, a3, A, A) = 
S(r, 1, 42,a3, A,A)M {(a,b) : 0 < u,v < 2"}. Recall that the first two vertices of the 
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Approximation of Real Functions 


5.1 Motivation 


It is not hard to show that the square root function cannot be computed using ratio- 
nal operations only. For example 2 is not a rational number; therefore it cannot be 
represented by rational operations on {0,1,2}. The floor operation would not resolve 
this problem, since its output is always an integer. Still, we would like to argue about 


computations that involve such functions. 


One direction of research may be to assume that the square root function is one of our 
basic operations in the computation tree, and consider the complexity of computing other 
functions, given it as a basic operation. Another direction is to relax the requirement on 
the output of the computation tree. Instead of requiring it to be equal to the computed 


function, we can require it to “approximate” the computed function. 


The first approach, where the square root is consider as a basic operation, was in- 
vestigated in [SY76, Pip81, Ja’81]. The main complexity measure that they considered 


was the number of square root operations that have to be performed. It was shown, for 
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The norm L,, is defined as, 
Il flloo = sup |f(2)| 
reA 


It is well known that the norm of a function || f||, is monotonically non-decreasing in the 
parameter k, i.e. || fll < |[fll2 < +--+ < || fll. This implies that showing a lower bound 


for || f||1, implies a lower bound for any Ly, including Leo. 


We still need to make a connection between the norm of a function and the quality 
of an approximation. Given a certain point 2, the difference between g(x) and A(z) is 
a measure of how well g approximates A at that point. This suggests that the norm of 
g —h would be a good indicator of how well g approximates h. More formally, we define 


a function error;(g,h) as follows, 


errors(g, h) = [lg - Alle = ¥f f lola) — A(x) [Ade 


The function error;,(g,h) is how well g approximates h in norm Ly. A function g(z) 


approximates a function h(x), within e, in norm Lg, if error(g,h)z < €. 


The more intuitive norms are probably L.. and L, (although norm Lz plays an im- 
portant role in many applications). The norm L. is intuitively the worst case, for 
approximation it would mean the worst case approximation, over any input. An € ap- 
proximation in norm L,, implies that for any input in A, the difference between the two 
functions is at most ¢. The other norm is £1, which intuitively captures the average case. 
In the case of approximation one can view it as the expected error for a randomly chosen 


input. 


In this chapter we show a technique to derive lower bounds for approximation in 
norm L,. Clearly any lower bound in norm Ly, implies a lower bound in any other norm, 


including Lo. 
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path, P, in the computation tree. Note that this would not be true for an integer input, 
since, for example, branching according to whether the input is odd or even, divides the 
integers to two sets, such that neither set has two successive integers. The other point is 
that we need / to be “large”, otherwise we will not be able to establish a lower bound at 
the end. For example, if / < €, then the square root function can be approximated by a 


constant over such an interval. 


The proof that there exists a path P is done by constructing it inductively. The 
inductive claim uses a prefix P; of P, that includes the first 7 vertices, and a subinterval 
[a;,a; + /;], such that the value of each vertex in P;, for inputs from [a;,a; + J;], can be 


expressed as a rational expression, and any input from [a;, a; + /;] follows P;. 


In the inductive step we show that P; can be extended to P;4; and maintain the 


inductive claim. Here we have two cases, depending on the type of last vertex of P;. 


If the last vertex in P; is a comparison vertex, we need to show how to choose one 
of its sons as the next vertex v;4,. In this case we need to show that there is a “large” 


subinterval [a;41, ai41 + 4:41], where all the inputs continue to vi41. 


The other case is when the next vertex on P is a computation vertex. If the computa- 
tion vertex has a rational operation, then the inductive claim is maintained trivially. The 
interesting case is when the vertex computes the floor function. Such a vertex receives as 
an argument the value of some previous vertex on P;, which by the induction hypothesis 
can be expressed as a rational expression. Using the above argument about |r(z)|, there 
is a “large” subinterval [a;41, a:41 + li41], in which the value of this vertex is constant. 
We continue by setting the value of the vertex to that constant, and restrict the inputs 


to the subinterval [aj41, a:41 + liga]. 


To summarize, the construction so far shows that for any computation tree J’, defined 
over inputs x € [1,2], there is a subinterval {a,a+/], and a rational expression r(x), such 
that T(x) = r(x) for any x € [a,a +1]. This by itself does not imply any lower bound 
but reduces the problem of proving the lower bound for T, over [1,2], to the problem of 
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lemma shows that the “penalty” of restricting the rational function to a subinterval in 


which it has no roots or poles is rather limited. 


Lemma 5.2 Given an interval [a,a + 1] and a rational function r(x) of degree d, 
there exists a subinterval [a,a+ 4] C [a,a+ I] that does not contain any poles or zeros 


of r(x), and furthermore, r(x) is monotone in this subinterval. 


Proof: Let r(z) = Bey where p(x) and q(z) are polynomial of degree at most d. Consider 
the 6d + 1 subintervals [a,a + sari} (a + sani 4 + Pret eee .., {a+ aga +1]. In each 
subinterval in which r(x) has a pole, the polynomial g(x) has a zero. In the worst case, 
each zero of q(x) can be at the boundary of two adjacent subintervals. Since q(z) has 
at most d zeros, the number of subintervals that contain a pole of r(x) is at most 2d. 
Similarly, the number of intervals that contain a zero of r(x) (which is a zero of p()) is 


bounded by 2d. Therefore, there are k > 2d +1 intervals that do not contain any pole 


or zero of r(z). 


Now, consider these remaining k intervals. Each interval where r(x) is not monotone 
contains either a local maxima or a local minima in its interior. At each local maxima 
or local minima, the derivative of r’/(x) has a zero. Since the degree of r’(x) is at most 
2d, it can have at most 2d zeros. (Notice that the poles of r’(x) are same as the poles of 
r(z).) Hence, there exist k — 2d > 1 subintervals that do not contain any poles or zeros 


of r(x), and furthermore, r(x) is monotone in each of these subintervals. QO 


The following lemma is a classical result, known as the Markoff inequality. It relates 


the value of the derivative of a polynomial to the polynomial value and degree. 


Lemma 5.3 (Markoff [Che66, page 91]) Let p(x) be a polynomial of degree d, and 
p'(x) the derivative of p(x) with respect to x. Then 


max {ip'(2)|} S@ max {[p(2)}. 
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on polynomials, we can also bound the value of a rational expressions. Using the above 


lemma, we can derive the following lower bound for a polynomial. 


Lemma 5.6 Given an interval [a,a +1] and a polynomial p(x) of degree d > 1 with 
integer coefficients, there exists a subinterval [a,a+ ral Cc [a,a+1] such that (i) p(x) is 


monotone in this subinterval, and (ii) |p(x)| > (4)4 for all z € [a,a+ se}. 


Proof: Let 5 be the leading coefficient of p. Since d > 1 then 6 0, furthermore, since 
b is an integer, this implies that |b] > 1. Assume that K is the largest value of |p(z)| in 
the interval [a,a + /]. First, we argue that there is a subinterval of length 6 = at such 
that p(x) is monotone and is greater than K/2, throughout this subinterval. Then, we 


show that K/2 > (1/4). 


Let ys = a+ gy fori = 0,1,...,8d°. Consider the 8d° subintervals [yi-1, yi], for 
i = 1,2,...,8d°. Choose zo € [a,a +l] such that |p(zo)| = K. Choose 7 such that 
to € (yj, ¥;41]. Assume that j < 4d°, and consider d consecutive subintervals [y;4:, y}4:41] 
for 1 = 0,1,...,d —1. (The case 7 > 4d° is similar.) Since the polynomial p(zr) has a 
total of at most d— 1 local maxima and local minima, p(x) is monotone in one of these 


d subintervals. 


Next, we show that |p(z)| > K/2 for all z € [y;,yj4a]. For all real wu such that 
tot+u€ [a,a+1], Lemma 5.4 implies that |p(zo + u)| > |p(zo)| — WE K, Therefore, for 
all x € [y;, yj+a], [p(x)| > K — HK = K(1-4)> 4K. 


It remains to show now that K/2 is larger than (+). In order to prove this assertion, 
transform the polynomial p(x) into a polynomial q(z) with leading coefficient one, in the 


interval [—1,1] by setting 


9 
ee je 


“= 


2 
7 ), where 6 is the leading coefficient of p(x). 


a2) = + 
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exists a subinterval [a,a+ <q], such that |r(x)| > (2M)~1(2)~4 for all z € [a,a+ rae 


and r(x) is monotone in this subinterval. 


The following lemma is the main technique that we use to bound the value of the 
floor function. This lemma guarantees that there is a “large” subinterval in which the 
value of the floor function is constant. This implies that if we restrict our attention only 


to this subinterval, the value of the floor operation is constant. 


Lemma 5.9 Given an interval [a,a +l], where 1 < a < 2 andO0 <1 < 2-a, 
and a rational function r(x) of degree d with integer coefficients and maz-coef(r) < M; 
there exists a such that |r(x)| equals a fired integer J for all z € [a,a + M7}(pl)4*"] 
Cc la,at+U, where p < 1 is a fired constant that can be chosen to be 0.02. Moreover, 
lJ] < M(ul)~4. 


Proof: By Lemma 5.7, there exists a subinterval [@,@+ el in which r(x) is monotone, 
and |r(x)| < 2M(#)? for all x € [8,8 + <4;]. This implies that [r(z)| takes on at most 
4M(2)4 distinct integer values in this subinterval. By the pigeon-hole principle, there is 
a subinterval of length at least sera where r(z) is a constant. Let J equal the value 


of r(z) in that interval. Oo 


So far we were interested in the behavior of rational functions and the floor function. 
The next lemma shows a property of the s root function. This lemma establishes a 
connection between the degree of a rational expression, its maximum coefficient, and how 


well it approximates the s root. 


We should note that one can bound the approximation as a function of the degree, 
independent of the size of the maximum coefficient. The techniques used to achieve such 


bounds are more involved, and are not needed in our case. (See, for example, [Bra86].) 


Lemma 5.10 For an integer s > 2, given an interval [a,a+l] C [1,2] wherel <a <2 


and 0 <1<2-—a, and a rational function r(x) of degree d with integer coefficients and 
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2. For each computation vertex y on the path P, there is a rational function r,(z), 
with integer coefficients, such that for any input a € [a,a+/], the value computed 


at vertex v equals r,(q@). 


Finally, we conclude the desired lower bound on the length of P (and hence, a lower 
bound on the depth of 7’) by applying the result of Lemma 5.10 to rational functions 
that approximate \/z over the interval [a,a + J]. 


Denote the vertices on the path P from the root of T to its leaf wy by v1, v2,..., U1, 
in that order, where v; is the root of the tree T and v; is a child of v;_1. We define the 
path P and the subinterval inductively, starting with the empty path and the interval 
[1,2]. As part of the induction hypothesis, we maintain three properties of the path and 


the interval under consideration. These properties are described below. 


Suppose that (a) we have selected a prefix of P, which starts at v,, and ends at a 


vertex vj41, and (b) defined a subinterval [a;, a; + /;] with the following properties: 


1. For each input zx € [a;,a; + J;] the computation follows the path from the root to 
Vi41s 

2. For each computation vertex v; on the path from the root to the vertex v,41, 
excluding the vertex vi41, there is a rational function r;(x), with integer coefficients, 
such that val(v;) = r;(x), for any input z € [a;, a; + 4], 

3. Let L; = {rj(x)|v; is a computation vertex, j <2}. Define D; = deg(X;) +1 and 

. i? i? . 

M; = maz-coef(X;). Then, D; < 2', M; < po? , and 1; > pe , where y is the 


constant in the statement of Lemma 5.9. 


The basis of the induction is 2 = 0, and the path is uv), which is the root. All the 


claims hold trivially in this case. 


We show how to define [a;41, a;41 + 1:41] and how to choose the vertex v;42 such that 


Properties 1-3 will be satisfied by the interval [a;41, ai41 + li41] and the prefix of P that 
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bounded by 2(D; + 1)M?. Therefore, Dis: < 2Di, Mign < 2(Di + 1)M?. For each 


z € [Gj41, Gia1 + Ui41] and rizi(xz), Properties 1-3 are satisfied. 


Suppose that f,,,, = Lfe,| = |g(x)|, for some 1 < j < 2%. By our hypothesis, 
deg(g) < D; and maz-coef(g) < M;. By Lemma 5.9, there exists an integer J, and 
an interval [aj41, @i41 + Ji41], such that (i) for any z € [ai41,a:41 + Gai], [g(2)] = J, 
JJ] < Mi(pls-P* <p pr BOH™) yo” and (ii) lan > (ul) > 
p(it2? (2441) > perry” | Define rj4,(2) = J. Then, Dj4; = D; and M;,,; = max{M,, J}, 


and properties 1-3 are satisfied. 


Thus, we have proved the existence of a path P and an interval [a,a + /] such that 
properties 1-3 are satisfied for all vertices on P. We have also established that there 
is a rational function r(x) such that for all a € [a,a + /] the value produced by T (at 
the end of the path P) on input a is given by r(@). Furthermore, that deg(r) < 2°, 


M = maz-coef{(r) < ane and | > ie 


By Lemma 5.10 there exists a subinterval [a,a+'], l= T(sGalsed INT = 0(=47), such 
that for any v in the subinterval, |v'/* — r(v)| > e(M(sd + 2))~4(£)*84+5 = 6, where c is 


a constant. This implies that we can lower bound error,(T, W/Z) as follows, 


error,(T, Wax) > él’ 


2 
The value of 6 is 2(M~4(4)°(s)), and we can show 6 > cz’ | where c, is a constant 


that depends only on s. Since T approximates \/z, in L,, within €, then 


gon? 
3 


Therefore, h = Q(,/log log +). O 


€>error,(T, Wz) > él’ >c 


Chapter 6 


Upper Bounds 


In this chapter we show how the power of the floor function can be used to accelerate 
computations. Clearly the floor function can help in computing the floor function and 
the mod function, but the interesting fact is that it can accelerate computations which 


we normally do not associate with the floor function. 


One issue of interest is the constants that the computation tree uses. A uniform com- 
putation tree has as constants only {0,1}, and every other constant has to be generated 
explicitly. On the other hand a non-uniform computation tree may have arbitrary initial 


constants. 


We show that many functions can be computed by non-uniform computation trees 
of depth O(1). This justifies the reason that the lower bounds proved in the previous 
chapters are for the uniform computation trees. For uniform computation trees we show 
upper bounds that, in some cases, match the lower bound that we proved in the previous 


chapters. 


We start, in Section 6.1, by showing that any polynomial has a non-uniform computa- 


tion tree of depth O(1) that evaluates it, and this constant does not depend on the degree 
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First we show how to construct a constant that encodes all the coefficients of a 
polynomial P(y). The constant has simply all the coefficients in order, each padded 
to m bits, where m will be chosen latter. More formally, let P(y) = 4.5 piy’, where 
the coefficients, p;, are integers. Define cf) = P(2™) = yt pi2™. For m, such that 
2™ > mar-coef( P), it is easy to verify that the m-bit integer obtained by considering bit 


positions im to (2 + 1)m — 1 of c® is equal to pj. 


The polynomial evaluation algorithm: 


The algorithm is constructed to evaluate the inputs of a specific polynomial P in constant 
number of operations. The algorithm receives as an input an n-bit integer a. The 
algorithm output is the value of P(a). The polynomial evaluation algorithm consists of 


two steps. 


1-2-™a 


Step 1. Compute 6 = = =|: Letting y(a) = D4, a'2™(4-9), we show that b = (a). 
For |x| < 1, the following identity holds, += = 7%, c'. Recall that a < 2”. We will chose 


m such that 2" < 2”, and therefore 2-"a < 1. This implies that —+.- = D%2)(2-"a)', 
and therefore 
gid md “Ma m(d—# a 
jee 2= mq a => 2 (2 =9 2 a 


7=0 1=0 
For m, such that a?1/2™ < 1/2, the sum &,,, 2™¢-a! is less than one. This, in 


combination with the fact that @ is an integer, implies that 


3 antl = (a). 


t=d+1 


a F 
he bs | = > gm(d-t) 3 + 


1=0 7=0 


The constants used in this step are 2? and 2”, and the requirement on m is that 


m>n(d+1)+1. 


Step 2. Extract the m-bit integer obtained by considering bit positions md to m(d+1)—1 
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6.2 Approximations 


The non-uniform case 


We show how to use the construction in the previous section, to approximate Vb for any 
6 € [1,2] in constant time. The existence of such a program does not contradict the 
lower bounds of Chapter 5, because it assumes that certain large constants are available 
for “free”. In other words, we show a O(1) non-uniform upper bound on the depth of 


computation trees for approximating \/z. 


For 6 € [1,2], let a = |[4b/e?|. It is not difficult to see that if |,/a — a| < 1 then 
|\/b — ae/2| < €. Below, we present an O(1) step algorithm for computing a. 


Our algorithm is based on Newton iteration for computing square roots. Recall that 


Newton iteration for computing the square root of a is given by 


a+2? 
tat = nee where 2g is the starting point. 
t 


Define 6; to be the relative error of z;, i.e. (1 +6;)\/a = 2;. It is easy to verify that 


6? 60.91 
6; = a+ fra) = A=) , assuming that do < 1 
Let ro = |3/e], this implies that /a < ro < 3 Ja. Since 0 < &9 < 1/2, this implies that 
6; < 2-47. Our aim is to reach an index ¢, such that |\/a ~ | < 1. This occurs when 
\6:,/a| < 1, or equivalently |6;| < co For t = O(log loga) = O(log log(1/e)), the value 
of 6; satisfies, 0 < 6; < 1/,/a. Thus, in order to compute a, it is sufficient to perform t 


Newton iterations, starting at zo. 


Notice that upon starting the Newton iterations at ro, the value of z, is obtained 


by evaluating the rational function R,(y) = ae at y = a, where P,(y) = y + 22 and 
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The uniform case 


The general idea in this case is similar to the idea in the non-uniform case. Using the same 


logic as before, for any 6 € [1,2] and a = [4b/e?|, if |a —a| <1 then |Vb — ae/2| <«. 


The main difference between the construction in the uniform case and the non-uniform 
case is the following. In the uniform case we will consider the approximation, z;, com- 
puted at the 7“ Newton iteration, as a function of the starting point zo, while assuming 
that a is constant. Recall that for the non-uniform case the roles were reversed; z; was 


viewed as a function of a, while zo was fixed. 


For a starting point zo, the value of the 7" approximation, z;, is given by evaluating 
a rational function H;(x) = as at the point z = zo, where Fi(z) = z* +a, Gi(z) = 22; 
and for 7 > 1, F(z) = aG?_,+ F?,, Gi(x) = 2Fi_1(x)Gi_1(z). (Note that the coefficients 


of F; and G; are integers.) 


The above identities can be used to define a straight line program, of length O(t) that 
computes H;. The rational expression H; performs t Newton iteration for a certain input 
a. This implies that H;(H;(zo)) can be viewed as first performing ¢ iterations, starting 
at Zo and ending at z; = H;(zo), and then performing t more iterations, starting at 2; 


and ending at rz, = H;(z,); and therefore, H;(H;(z)) = H(z). 


For the sake of simplicity we assume that € is a power of two!. Let d; = deg(H;), 
M, = maz-coef{ H,), and 2” = 5. Initially, maz-coef( F,), maz-coef(Gi) < 5 = Mo. From 
the definition of F; and G; it is clear that deg(F;),deg(G;) < 2'. One can verify that 
maz-coef{ F;), maz-coef(G;) < (Mo +1)". Therefore, d; = 2 = d and M; < (Mo +1)" < 
(2) =M. 


We choose m, such that 2" = 2(18)?"*" > 2M2?"4_ Clearly 2" and 2”4 can be 


computed in O(t) steps. Since H; has a straight line program of length O(t), and we 


'For an arbitrary € we can use the powers of two subroutine, describe in the next section, to find an 


e’ < € which is a power of two. The running time of this procedure would be O(,/log log +). 
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input: bye 
computation: 
Compute 2, 2?” and cH, = H,(2™). 


on fab 
fo = |8| 


FOR i = ltok DO 
£; = |H,(xj{_1)|/* invoke the polynomial evaluation procedure */ 


Figure 6-2: Approximating the square root uniformly 


would be ae/2. To conclude we proved the following theorem. 


Theorem 6.3 There exists a computation tree T(€,x), whose depth is O(,/log log ), 
such that for any b € [1, 2], 
[T(€,6) — VB| <e 


Note that the algorithm receives two inputs, € and 6, and outputs T(e, 6). 


Approximating the square root of an integer is closely related to deciding if an integer 
is a perfect square (i.e. whether an integer has an integral square root). We could decide 
if an n bit integer is a perfect square by first approximating its square root within one, 
and then checking the integers within one of that value. Note that in the approximation 
procedure above, a approximates \/a within one, where a is an integer. The problem 
in extending the result to deciding perfect squares is that we need to compute a “good” 
initial point, (i.e. such that 0 < d9 < 1/2) and a bound on Mp. For the approximation 
case, we chose the value of a such that it would be easy to compute a “good” initial 
point for it. The following lemma states that given a “good” initial point, and a constant 


2' > a, we can decide the perfect square problem in O(./log log a). 


Lemma 6.4 There exists a computation tree T(x, 20, 2'), whose depth is O(./log log z), 
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computing 6; = a* mod (a — b;-1), we can raise 6 to the 7k power in O(2) steps, as long 


as bY < a — -DF, 


For a given an integer a, we show how to compute 22" for all 2, such that go" <a. 
The computation is done in i steps. Starting with a; = (2a)? and b, = 2? we compute: 
a; = a}_, and 6; = a; mod (2a — },_,). (Recall that the floor and the mod operations are 
computationally equivalent, i.e. mod can be computed in O(1) operations using the floor 


operation.) 
We prove the correctness of the computation by induction. It is easy to see that 


a; = (2a)?"", Recall that x — y divides x* — y*, for any positive integer k, therefore 


2a — 6;_, divides (2a)* — ae for k = 27-1, Thus, 


b; = a;mod (2a = bi-1) 


gz 


(2a)?"" — ((2a)*"* — 7") mod (2a — b;-1) 


b2' mod (2a — b,-1). 


Observe that 627" = 22" We get that if 627" < 2a — bj, then b = QF By the 
i-1)? ; i i 
induction hypothesis b;_; = 2°-” <a. Thus, if 2? vie a, then 2? eas b;-1. Clearly 


the computation is done in O(7) steps. 


Seemingly, the above procedure gives us a way to compute 92” for i = | ees 
| Vlog log a| . However, note that | Vlog log a| is not known in advance. Thus, we are left 
with the following problem: How can we identify when to stop? That is, how to find the 
first i for which 22" > a and thus b; # 22" We cannot spend the time testing whether 


21-1 * . . . . . 
b; = 6?7/, by successive squaring, for each i, since it requires too many steps. 


The solution is to compute another variable d;, such that d; = ga" ort liegt +i, 
where / is the greatest integer such that b; = 92” This implies that d; = 6; for i < / and 


di41 # bi41. This allows us to detect the termination by simply comparing 5; to d; and 
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Applications 


We can now show a few applications of the powers of two subroutine. The first application 


is computing | Vlog log al. Let ¢ be the last iteration in which 5; = d;. Then, 


t—1 if b> a 
| viog log a| = 


t if 6: <a 


Thus, we can compute | Vlog log a| in O(./logn) time, for all n-bit integers. 


We extend the result of computing | Vlog log z| and show how to compute [log log z|. 
t2 

Using the powers of two procedure we can compute in O(t) steps, the number 2? , such 
1? 2 Y ; : ys ; 

that 277 <a < 2%”. The idea is that in O(t) additional steps, we can find a j such 


that 2?” <a <2?"", 


This is done by simply squaring 92" successively, until the first time it is larger than 
a. Note that since (t + 1)? — t? = 2¢—1, at most 2¢ — 1 successive squaring will be 
performed. We conclude that given an n-bit input a we can compute j = |logloga| 
in O( /log log a) = O(./logn) time. In Chapter 3 we proved an 0(,/logn) time lower 
bound for the respective decision problem. Therefore we have proved the following tight 


bounds. 


Theorem 6.5 There is a decision tree with OP = {+,-,*,/,|-|} and constants 
{0,1} that computes |logloga], for all n-bit integers a, and has depth O(.\/logn). Fur- 
thermore, any decision tree for this problem has depth 0(./log n). 


We can also use the powers of two subroutine to show that for infinitely many n, there 
is a decision tree, of depth O( /logn), that decides if an n-bit integer is a perfect square. 
In this case we restrict the integer input to be from the interval [2"~', 2”). Suppose that 
n = 241, In O(,/logn) time we can compute g2 2/2 and sates, 2”, starting 


from the constants zero and one, and the input. Using Lemma 6.4, with the initial point 


Chapter 7 


Conclusions 


In this Chapter we give a brief overview of the results that were presented in this thesis. 


We also mention a few open problems and possible directions for future research. 


In Chapter 3 we developed a lower bound technique for decision tree with operations 
{+,—,*,/,|{:|}, This lower bound technique could be used to prove 2(/logn) lower 
bounds for problems, such as deciding perfect squares, computing {log log x|, and other 


problems. 


As a historical remark, it is interesting that when the lower bounds were proven, there 
was a gap between the lower bound of 2(./logn) and the upper bounds of O(logn). At 
that time it was believed that the gap could be closed by improving the lower bounds. 
Only later, when considering closely the lower bound argument to see where it could be 
tightened, did the technique of achieving the upper bound emerge. This, and the other 
upper bounds in Chapter 6, show that the floor operation can add significant power to 


the computation. 


In Chapter 4 we have proved an 2(log log n) lower bound on the depth of any compu- 


tation tree with operations from the set {+,—,*,/,|-|}, that decides if two n-bit integers 
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are relatively prime. An upper bound for this problem can be obtained from the parallel 
algorithms of [CG90, KMR84]. Analyzing those algorithms, with respect to the number 


of arithmetic operations, reveals an upper bound of O(n/logn) for computing the gcd. 


Notice that the gcd can be written as an Integer Linear Program. Therefore, one 
of the consequences of our results is that there is no algorithm for the Integer Linear 
Programming problem, using operations only from the set {+, —, *,/, |-| }, whose running 
time depends only on the number of variables and the number of constraints, and not on 


the size of the coefficients. 


In Chapter 5 we prove lower bounds for approximating the s“ root of a real number. 
We show that approximating the s“ root, such that the average error is less than e¢, 
requires at least 9(,/log log +) operations. On the other hand, in Chapter 6, we prove 
an O(,/log log #) upper bound, when the computation receives both the number to ap- 
proximate and the accuracy parameter e. This procedure approximates within ¢ in the 
worst case (i.e. for each input the approximated value is within € of the correct value). 
Bashouty [Bas90b] has shown an (0(log log +) lower bound for worst case approximation. 
It still remains as an open problem the complexity of the average approximation for the 


s* root. 


We know that if the intermediate results in the computation can be exponential in the 
size of the input then any PSPACE computation can be done in a polynomial number of 
operations (see [BMS81]). On the other hand, if the intermediate results are restricted 
to be polynomial in the size of the inputs, then any computation with a polynomial 
number of operations is in polynomial time. One very interesting direction of research 
would be to show tradeoffs between the size of the intermediate results and the number 


of operations required. 
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input: a 
initialization: 6, = 27, a; = (2a)?, cy = (2a)4, dy = 27, and i= 1. 
computation: 

WHILE 6; = d; DO 


b; = a; mod (2a — 5;_1) 
d; = c; mod (a; — dj_;) 


END 
2 
output d; [Pde oe *] 


Figure 6-3: Computing powers of two 


terminating when they are not equal. It remains to show how to construct d;. 


In addition to the computation of a; and b;, we compute c; and d;, where: c, = (2a)*, 
d, = 2?, c; = c}®, and d; = ¢; mod (a;—d,_,). It is easy to see that c; = (2a)2"°. Notice 


g2i-1 2(i-1)? : “th: . 
that c; = af. Suppose that b;_, = d;_; = 2 , and consider the 7" iteration: 


d; = d?"~" mod (a; — d;-1). 
This implies that if d?";" < a; —d;-, then d; = 2”. Clearly, if d?"7" < 2a — bj, and 
bj-1 = dj-1, then d?""* < a; —d;-y. We conclude that for all i such that 2?” <2a—bj-1, 
b; = d; = 2?" . Consider the least i such that 2” > 2a—6;_,. Clearly, for this i, 6; # 22". 


)2 


i2 os A . fey 
However, we claim that d; is still equal 27). This is true since Der eg ee b;-1, 


implying that 


i2 i-1)? \ 921— i- i- i- 
OP ee Oa bal Gay Sb, Sue a: 
Thus, the first time 6; 4 d;, we stop the computation. Let ¢ be the last index such 


that 6, = d;, this guarantees that b; = got and a < 224” (Figure 6-3 gives a full 


description of our algorithm for computing powers.) 
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such that if /z < 20 < 3 z and 2' > a, then 


|P(x, 20,2') — /2| < 1. 


Proof: Let n = l, d; = deg(H;) and M; = mar-coef( H;). We show how to compute 
2” > 2M,2?"* in O(t) steps. Once we show this, the rest of the algorithm and the proof 
is identical to the approximation case. Recall that M; < (Mo+ 1)?"", Since x < 2!, 
then My < 2!', which implies that M, < (2'+1)?" = M. Since d; = 2¢ and 2" < 2, 
we can compute 2” = 2(2')?"*? > 247274" in O(t) steps. For t = O(/loglogz), the 
above computation time is O( /loglog z). Therefore the total running time of the entire 


procedure is O(/log log z). oO 


Later we show how to use the above lemma to prove that for infinitely many n, for 


any n bit integer, we can compute a “good” initial point in O(./log7n) steps. 


6.3. Computing powers of two 


i? : nip : 
Suppose that we want to compute 2?’ , for some 2. Intuitively, it seems that the fastest 
way to do it is by successive squaring. This gives an O(2?) step procedure for computing 
this number. Surprisingly, using the floor operation, and given a large enough number, 


we can compute this power in O(z) steps. 


The main idea is based on the following observation. Let a, 6 and k be positive 


integers, and a > b. Then, 
a* mod (a — b) = ba*“! mod (a — b) = * mod (a — 8). 


If b* < a — 6b then the value of the above computation is simply b*. That suggests that 


once we raised a to the k** power, we can raise b to the k power in O(1) operations. By 
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already computed 2”, we can compute cy, = A,(2™) in O(t) steps. To summarize, so 
far we have computed 2”, 24 and c%}, in O(t) steps. Thus, we have computed all the 
constants necessary for Theorem 6.1, and therefore an evaluation of H; can be performed 


in O(1) operations. 


The computation evaluates H, k times; the input to the 7“ evaluation of H;, 2;, is 
the output of the (i-1)** computation, i.e. 2; = H;(z;-,). Each evaluation of H; costs 
only O(1), therefore the length of the program is O(t + k), and it performs tk Newton 
iterations. This implies that it can perform O(log log +) iterations in O(log log 4) steps 
(by choosing t = k). 


However, we are still left with a slight implementation problem. Notice that z,, for 
t > 0, may be non-integer, while the polynomial evaluation procedure is valid only if x; 
is an integer. (Another condition is that 2; < 2", but this is always true.) To fix this 
problem, suppose that in each iteration instead of computing z;, we compute the smallest 


integer greater than z,. Formally, define the modified iteration as follows. 
‘ F(4i) 


We claim that this does not change the convergence rate by more than a constant factor. 


Let 2; = (1+ 4;),/a. It can be verified that, 


; ae | 
[dita] S (5) + Te 


As long as b; > an this implies that 2( bist)?" > Tn and therefore, 


5x4 2¢ agit 
oy < 8 


On the other hand, if 0 < 4; < Fa then /a < 2; < /a+2. This implies that 


| /a — (2; — 1)| < 1. Therefore, we can set a = #; — 1, and the approximation to Vb 
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input: 
computation: 


Figure 6-1: Approximating the square root non-uniformly 


Qi(y) = 220. In general, z; is obtained by evaluating a rational function 


P,-1(y) 
R,(y) = Py) = yt mn = yQi-s(y) + PAi(y) 
(y) oF 2Pi-1(y)Qi-1(y) 


at the point y = a. This implies that P;(y) = yQ?.i(y) + P2.1(y) and Qi(y) = 2Pi-1(y) 
Qi-1(y), forz > 1. (Note that the coefficients of P; and Q; are integers.) We are interested 
in computing a = R,(a) = P,(a)/Q:(a). Note that given xo and t the polynomials P; and 
Q; are defined. Furthermore, zo depends only on e, and not on a. Since, the coefficients 
of P; and Q; are integers, by Theorem 6.1, we can compute P,(a) and Q;(a) in constant 


time, for any n-bit integer a. Hence, we can compute a = R,(a) in constant time. 


Let 8 = ae/2. Since a can be computed in O(1) steps, @ can be computed in O(1) 
steps. As we discussed before, |b — 8| < «€, therefore we have shown the following 


theorem. 


Theorem 6.2 There exists a constant C such that, for any 0 < € < 1 there is a 


straight line program L, of length C, such that for any b € [1,2], 


|Vb— L.(b)| <€ 
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of y(a)cy. More formally, compute 
B= ly(ayera-m4| — ly(a)ema-™e-™ | 7 lee 


We claim that P(a) = 8. Consider the product y(a)cf, then 


d d 
alg = Lo yeneiantos 


7=0 7=0 


d  min{d,d+k} 


2 ot eee 


k=—-d i=max{0,k} 


d 
a > quant?) 
k=-d 


Note that P(a) = 2.) pia’ = go. For m, such that 2" > max;{gi}, the m-bit integer 
obtained by considering bit positions md to m(d+ 1) — 1 of y(a)cy is go = 8. This 
concludes the description of the polynomial evaluation algorithm. Below we bound the 


value of m that meets the above requirements. 


Let, maz-coef(P) = M. Since a < 2", then the value of a‘ is bounded by 2™. Thus, 
each g, is bounded by dM2"4, In step 1 we required that m > n(d+1)+1, therefore it 
is sufficient to chose m > 2nd + [log M] +1. (Note that this implies that the size of the 


constants used are polynomial in d, n and log M.) 


Recall that a straight line program is a computation tree without any comparison 


nodes. Thus, we have shown the following theorem. 


Theorem 6.1 There exists a fixed constant C such that, for any polynomial P with 
integer coefficients, there is a straight line program L of length C, that computes P(r) 
for all n-bit positive integers. If deg(P) = d and maz-coef(P) = M, then the constants 
used in L are 2™, 2™+, and oS for any m > 2nd + flog M] +1. 
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of the polynomial. In section 6.2 we apply the techniques developed for evaluation of 
polynomials to the problem of approximation of square roots. For the non-uniform case, 
we show a straight line program that approximates the square root in O(1) steps. For the 
uniform case, we show a computation tree T that, on input € > 0 and z € [1, 2] computes 
T(e€,z) such that |,/z — T(e,x)| < €, and the has length O(/log log +). In contrast, it 
is known that using only rational operation, O(loglog +) steps are both necessary and 


sufficient (see [Bra86]). 


Note that when we are considering approximation of square roots, we are considering 
two classes of algorithms. The first class is algorithms that receive only the input, z, and 
although they have a guarantee that the output is at most ¢ from the square root, they 
do not use the value of € in the computation. An example of such an algorithm is the 
Newton iteration method; the lower bound that we show is for this class of algorithms. 
The second class, is an algorithm that has two inputs, the point z, and the desired 
approximation e. Unlike the previous type, here the algorithm uses the value of « during 


the computation. The upper bound belongs to the second class of algorithms. 


Section 6.3 shows that the floor function can be used to accelerate exponentiation. 
Suppose we would like to generate the number go Using repeated squaring this can be 
done in O(i?) steps. We show that this number can be generated in O(7) steps, given 
any integer z > 92" (and using the floor function). Using this procedure we show that 


part of the lower bounds proved in Chapter 3 are tight. 


6.1 Computing polynomials 


We consider the complexity of evaluating a polynomial, whose coefficients are integers, 
on an n-bit integer input. We show that this problem can be solved in constant time, 


using arbitrary constants and the floor function. 
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starts at v,, and ends at vj42. We choose [aj;41, ai41 + i41] as a subinterval of [a;, a; + |;]. 
Therefore, by the induction hypothesis we have that Properties 1-3 are satisfied by the 
interval [a;41, a:41+1;41] and the prefix of P that starts at v, and ends at v,41. In order to 
complete the proof of the lemma we need to show that (a) there exists an outgoing edge of 
vig such that for any input z € [ai41, 4:41 + U:41] the computation follows this edge, and 


(b) Properties 2-3 are satisfied also for the vertex vj,, and the interval [aj41, ai41 + igi]. 


By the definition of the tree T’, the vertex v,4; is either a comparison vertex or a 
computation vertex. If it is a comparison vertex, then a comparison g(x) < A(z) is 
performed. If it is a computation vertex, then either fy,,, = rigi(z) = g(x) 0 h(z) 
foro € {+,—,*,/}, or fu4, = |g(z)] is evaluated. Here, g(x), h(x) € {0,1} U {f,, |v; 


is a computation vertex, j <7}. 


By the induction hypothesis, g(x) and A(z) are rational functions of degree less than 


D; and maz-coef(g), maz-coef(h) < M;. 


The proof is based on a case by case analysis. In each case, we define the next vertex 
vita On the path P, the interval [a;41,a;41 + li4:], and the rational function ri41(z) 


(whenever v;41 a computation vertex). 


First, we resolve the case when v;41 is a comparison vertex. The comparison is of the 
form g(z) < h(x). By the induction hypothesis, both A(z) and g(x) can be represented 
as polynomials of degree less than D,;. Consider the rational function g(x) — h(x), which 
is of degree less than 2D;. Use Lemma 5.2 and choose a subinterval [a;41, ai41 + a 
where g(x) — h(x) does not have any poles or zeros. Set l;41 = vay: It is now easy to 


check that properties 1-3 are satisfied in this case. 


Next, consider the case when v,4, is a computation vertex. The following possibilities 


may arise. 


Suppose that o € {+,—,%,/}. Let riyi(z) = g(x) 0 A(z), aig, = a;, and [41 = fj. 


By Lemma 2.1 the degree of r;4:(z) is less than 2D;, and its maximum coefficient is 
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maz-coef(r) < M, there exists a subinterval [a,a+ Tay such that for any v in the 


subinterval, {v/s — r(v)| > (2(2M)*)-1( sh )lod+0) 


Proof: Let z* = v, such that Wa <2 < Wat 4 < Wa +1 Consider the rational function 
E(x) = a—-r(z*). Clearly, the degree of F(z) is bounded by sd+1 and maz-coef(E) < 2M. 


By Corollary 5.8, there exists a subinterval [8,8+ A] C [y/a, a+ 4], such that 


A= agaaqa? and for any x € [8,6 + A}, |E(x)| > (2(2M)*)-1(<6 64+), 


Let a = 8° and the subinterval be [a,a + sA/2]. For any v € [a,a + sA/2], then 
xz €[8,8+A]. This implies that on the subinterval [a,a+s\/2], then difference between 


v/s and r(v) is at least (2(2M)?)-*( Ly 6440), ci 


5.5 Lower bound for approximation 


In this section we use the technique that we developed in the previous section to establish 
the lower bound on approximating y/z. We show an (,/log log 4) lower bound on the 
depth of any computation tree T, with OP = {+,—,x,/,|-|} and constants {0,1}, that 


€-approximates \/z, where s is any fixed constant, for x € [1,2], in norm LZ. 


Theorem 5.11 For any integer s > 2, if a computation tree T, with operations 
OP = {+,-,*,/,|-|} and constants {0,1}, approzimates W/z, for a fired s and x € [1,2], 
such that 

error,(T(x), /z) <e 


then T has depth h = 2(,/log log +). 


Proof: The bulk of the proof involves constructing (i) a path P from the root of T to 


one of its leaves, and (ii) a subinterval {a, a+ (] of [1,2], with the following properties: 


1. On input a € [a,a+ 1] to T, the computation follows the path P; and 
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Note that q(z) and p(z) have the same degree and, 


(5) max {la()1} = max {Ip(2)1) 


ze[-1,1] "ela nal 


By Lemma 5.5, there exists z € [—1,1] such that |q(zo)| > 2-?+1. Therefore, K > 
2-4+1/5| (£)4. (Recall that |b] > 1.) Hence K/2 > (4)?, which completes the proof. 0 


The following lemma combines the above lemma, which enables us to give lower 
bounds for the value of a polynomial, with the first lemma, that observed how to upper 


bound a polynomial, to establish a technique to upper bound a rational expression. 


Lemma 5.7 Given an interval [a,a+ 1], where] <a < 2 and0 <1 <2-—a, and 
a rational function r(x) of degree d with integer coefficients and maz-coef(r) < M; there 
exists a subinterval [a,a + <4;], such that (i) r(x) is monotone in this subinterval, and 


(ii) |r(z)| < 2M(2)? for all x € [a,a + sy]. 


Proof: Let r(z) = es where p(z) and q(z) are polynomials of degree at most d with 
integer coefficients. By Lemma 5.1, the maximum value of |p(x)| for x € [1, 2] is bounded 
from above by M2¢+!, By Lemma 5.2, there is a subinterval [3,2 + +] in which r(z) is 
monotone. By Lemma 5.6, there is a subinterval of [6,8 + 4], denoted by [a,a+ ae 
such that |q(x)| > (4)? for all c € [a,a + <3]. Therefore, |r(x)| < 2M(?)? for all 
x € [a,at sal: Oo 

Let ri(xz) be a rational expression and r2(r) = eR Since both the degree and 
the maximum coefficient of r; and rz are identical, a lower bound for the value of r; 


would imply an upper bound for the value of r2. The following corollary formalizes this 


observation. 


Corollary 5.8 Given an interval [a,a+/], where 1 <a<2 and0 <I <2 -<a, and 


a rational function r(x) of degree d with integer coefficients and maz-coef(r) < M; there 
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We need to use a slight modification of Markoff inequality. The segments that we are 
interested in are not necessarily [—1,+1]. Therefore the following formulation would be 


more convenient. 


Lemma 5.4 Let p(z) be a polynomial of degree d, and p'(x) the derivative of p(x). 
Then 
oe 
max tle} S 4- max {lp(@)[}- 
Proof: Consider the substitution « = z4 + 24t!. If z € [—1,1] then z € [a,a+ J]. Let 
q(z) = p(zz + 74H). By Lemma 5.3 


x {lq'(z)I} < @ apax {la(2)It- 


zl a ry -1,1) 


: dqgdz _ dp dz _ 2 
Since 2S = &, and = = 7, the lemma follows. a 


The importance of this lemma is that using it we can claim, that if over the interval 
{a,a + /] the maximum value of a polynomial is p(x’), then there is a neighborhood of 


‘ 


z’, in which the value of the polynomial is Q(p(z’)) and the size of the neighborhood is 
Q(l/d?). 


The family of polynomials, known as Chebyshev polynomials, have many application 
in Numerical Analysis. We use here only one aspect of the Chebychev polynomials, and 


that is their ability to give a lower bound for the value of a polynomial. 


Lemma 5.5 (Chebyshev [Riv69, page 31]) Let p(x) be a polynomial of degree d > 1 
defined over [—1,1]. If the leading coefficient of p(x) is one, then there exists zo € [—1, 1] 


such that |p(xo)| > sasr- 


The importance of the above lemma is to enable us to establish a lower bound on 


the value of a polynomial. Once we are able to show a lower bound and upper bounds 
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proving a lower bound for rational expression, over [a,a + ]. 


The last part of the proof involves proving lower bounds for rational functions. We 
show that for a rational expression r(x), defined over [a,a + J], there is a subinterval 
(a, + A], such that for any input x € [a,a + A], the difference between r(z) and \/z is 
at least 6. This implies that 


IIr(z) — Wahi 2 5d. 


5.4 Lower bound technique 


We would like to establish a technique to bound the value of a rational function over an 
interval. In general, a rational function may be unbounded; therefore we are seeking a 
“large” subinterval in which the rational function is bounded. The techniques that we 
develop show how to both upper bound and lower bound the value of a polynomial. The 
first lemma shows a very trivial upper bound on the value of a polynomial defined over 
[1,2]. Much of the remainder of the section is devoted to showing that given a polynomial, 


there is a subinterval in which we can lower bound the value of the polynomial. 


Lemma 5.1 Let p(x) be a polynomial of degree d and maz-coef(p) < M. Then, for 
all x € (1, 2], |p(x)| < M271, 


Proof: Let p(x) = 4.) a2. Then clearly, 


d : d ; d ; 
|p(x)| = | So a;2"| < ye la;|z’ < >> M2 < M2¢*! 
i=0 t=0 1=0 
Oo 


It would be conve: -at that the rational functions that we are dealing with would 


not have any poles (i.e. inputs for which they go to infinity) or any roots. The following 
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Recently, Bashouty ([Bas90b] has shown that approximating the square root of two, 
requires 92(log log +) operations, when the only input is z, and ¢€ is not an input. This 
implies a lower bound in norm L,,. It is still an open problem what is the bound for 


norm L,, where the only input is the number to be approximated, and not e. 


5.3. Overview 


In this section we give a general overview of the proof that we show in the next two 
sections. At a very high level, we establish the proof along the following lines. First, we 
show that there is a “large” subinterval, such that all the inputs from this subinterval 
follow the same path. Second, the output that the computation tree computes for this 
subinterval can be expressed as a rational function. Third, any rational function, of a 
given degree and coefficient size, would have to be “far” from the s™ root function, on 
a “substantial” part of the subinterval. Clearly we would have a tradeoff between how 
“far” the functions are, and how “substantial” is the part of the subinterval on which 
they are “far”. Combining those two parameters translates to a lower bound on the 


approximation in norm Lj. 


Now we elaborate slightly more on each component of the proof. We start with 
the main tool that enables us to show how to handle the floor function. Consider the 
expression |r(xz)|, where r is a rational expression, defined over [a,a + /]. We show that 
there is a sub-interval, [a’, a’ + l'], such that the value of [r(x)| is constant in it and the 
ratio between / and /’ is bounded as a function of deg(r) and maz-coef(r). The smaller the 
ratio between /’ and /, the better lower bound we can prove at the end. The proof of this 
claim quite involved, and uses theorems from Approximation Theory, such as Chebyshev 


polynomials and the Markoff inequality. 


Given a computation tree T that receives an input xz € [1,2], we show that there 


exists an interval, [a,a+ [] C [1,2], such that any input z € [a,a+ /] follows the same 


58 . CHAPTER 5. APPROXIMATION OF REAL FUNCTIONS 


example, that computing the average distance of n points in the plane requires exactly 


(3) operations. (For average distance, [SY76] showed that it can be approximated with 


n 


,) operations. ) 


substantially less than ( 


This work pursues the second direction. Rather than computing a function exactly, 
we are interested in approximating the function’s value. The set of basic operations 


does not change and remains the set of rational operations and the floor function, i.e. 
{+, yay /; LJ } 


This chapter is organized as follows. In the following section we define formally the 
meaning of an approximation. In section 5.3 we give a general overview of the lower 
bound technique that we develop. In section 5.4 we develop the lower bound technique, 
that we later use in Section 5.5 to prove the lower bound for approximating the s‘ root 


of a real number. 


5.2 Definitions 


The main aim of this section is to define formally the notion of approximation. We start 
by a very brief background, and define inner product of real functions, and a norm of a 
real function. The inner product of two real functions f and g, over a region A, is defined 


as follows, 


<f.g>= | fla)g(z)d2 


Given the definition of the inner product, We define the norm Ly, of a function f, 


over a region A, to be 


Ile = 4) f Lee) Fda 
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tree are input vertices of the inputs a and 8. If (u,v) is the generator of (a,b), then, by 
Lemma 4.6, @ and 6} are polynomials in u and v of maximum coefficient and degree less 
than 2" and n/5, respectively. This implies that the number of monomials in each 
of these polynomials is at most n?/5 and that the value of each monomial is at most 
anil? g2ntl® antl Therefore, a, b < n2/8gn'/* 2n7/Sant/® _ 72/594n*/° < On for large enough 


n. 


Lemma 4.7 asserts that some pairs in the set (u,v) € S(0,a1,a@2,a3)N {(u,v) :1< 
u,v < 274(t+1)} are relatively prime, and some are not. The Correspondence Property, 
Lemma 4.1, guarantees that the gcd of the inputs is the same as the gcd of the generators. 
Since the leaf v is labeled by a constant, the inputs that reach this leaf are either all 
relatively prime, or all not relatively prime. Since all the pairs reach the same leaf, we 


reach a contradiction. Oo 
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(u,v) is the generator of (x,y). By Lemma 4.4, the degree of ae fo) OES is less than 
g24it1) 


2D; < pee and its maximum coefficient is bounded by 2M?(D; + 1)? < 2 


Suppose v;41 is |y|, where v is a previous computation vertex. By the induction 
hypothesis, val(v) = ees, where (u,v) is the generator of (z, y). By Lemma 4.5, there is 
a rational expression Q(z, y), and a subset of the inputs S“+"), such that val(v) = Q(u, v), 
where (u,v) is the generator of (x,y). The degree of Q is at most Dj4; < 4D! < aa 


4(i+1) 
and the maximum coefficient of Q is at most Mi4, < 2(4.D;M;)2” < 2? 


We are not done yet with this case. The substitution of uj by Aiz1(uigi)*+? + uiza, 
affects not only the rational expression Q, which is stated in Lemma 4.5. It also changes 
all the polynomials in ;. Since 6:41; < D;+1, the degree of the rational expressions in 4; 
is at most D4, < D,;(D;+1), and the maximum coefficient is at most Miz1 < Ni 2?! Mj. 
Note that the degree and maximum coefficients of previous vertices change, but they are 


bounded by the current D; and M;. O 


4.4 GCD Lower bound 


In this section we show how to apply the proof technique that we developed in the 
previous section to derive a lower bound for computing the greatest common divisor of 


two integers. 


We start by showing that in any set of inputs S(-), there is one input that is relatively 


prime, and another one that is not relatively prime. 
Lemma 4.7 Let a;,a2,a3, and t be positive integers such that a, < t, and a2,a3 < 
2'. Then the set S(0,a1,a2,a3)N {(u,v) : 0 < u,v < 2%} contains two pairs (ao, a1) 


and (bo, b,), such that gcd(ao, a1) #1 and gcd(bo, b,) = 1. 


Proof: Let e be the least positive integer exponent such that (1 + a3)*° > a2. Define 
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Lemma 4.6 Let T be a computation tree of depth h, with two inputs, the operations 
{+,—,*,/,|-|} and constants {0,1}. Then, there is a path P from the root of T to a 


leaf, and a subset S of inputs, with the the following properties: 


1S = S(r,a,a2,a3,A,A)N {(a,b) : 0 < a,b < 2"}, for some positive inte- 
GETS T, C1, A2, 3, 61, 62,..., 57, Ary A2,+--;Ar, where A = (6), 62,...,6,-), and A = 
(Ai, A2,---,Ar); 


2. For each input (a,b) € S. the computation follows the path P; 


3. For each computation or input verter v on the path P, there is a pair of bivaria: 
polynomials (F,(z,y),G.(z,y)) with integer coefficients, such that for each in; . 
(a,b) € S, G(u,v) £0, andval(v) = ate, where (u,v) is the <r, a1, Q2,03, A, A>- 
generator of (a,b); i.e., the value computed at v on input (a,b) € S, is the value of 


the rational expression eeu at (u,v); and 


4. Let % = {F,(2,y),Gi(z,y) | v € P}. Define D and M to be the degree and the 
mazimum coefficient of ©, respectively. Then, r < h, max{a,,D} < 22" and 


max{a2,a3, M} < g27" 


At first sight it might be surprising that the above lemma does not claim explicitly 
any bound on the values in A and A. The reason is that we can add the input as two 
input vertices in the tree 7’, and therefore, we express the input as a rational expression 


of the “generators”. This guarantees the connection of the values in © with A and A. 


Proof: We denote the vertices on the path P by v,v2,...,u1, in that order, where v1 
is the root of the tree 7, v; is a child of v;_;, and y is a leaf of the tree T. The path 
P and the set S are defined inductively, starting with the path v,,v2,v3 and the set 
S() = $(0,1,0,1) (which consists of all pairs (a,6), where a > b > 0). Following that 
proof, suppose that (a) we have selected a prefix of P, which starts at v,;, and ends at 
a vertex v;41, and (b) constructed the set S@ = S(r®, al af, af), AM, AM) with the 


following properties: 
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and, 


a’, = max{de,72(R), m2(L4P2— RY} < MP*) + ((D+2)M)?*? 41 


lA 


SDM 


This guarantees that for any (u,v) € S(0,a}, aj, a3),0< R(u, v) and R(u, v) < 
L? P(u,v). This implies that 


R(u,v) 


soe 
Tapas 


We conclude that for (z,y) € S(0, a}, a4, 7a3), 


ie d 
[P(2,y)) = Aw 1s A (2,9) 


Clearly, deg(Q) = deg(A) < D, and maz-coef(Q) < maz-coef(A) + 2M? < 2((2+ 
D)M)P*}, 


In Case 5 we show how to reduce the general case to Case 4. In order to summarize the 
effect of Case 4, observe that max{a}, deg(Q)} < (D+1)*, and max{a}, ta3, maz-coef(Q)} 
< 2((D + 2)M)?*?. 


Case 5: The leading monomial of P,(z,y) is La4y!. 


Our goal is to reduce this case 
to Case 4 where no powers of y appear in the leading monomial. We introduce a new 
variable z and substitute z using it. Let 6 = max{ay, deg(/P,) +1} < D+1 and 4 = a3. 
we substitute z by Ay’ +z and consider the polynomial Q(y, z) = P2(Ay®+2,y). Observe 
that the leading monomial in Q(y, z) is a constant times a power of y, i.e., no power of z 


appears in the leading monomial of Oly, z). Thus, we have reduced this case to Case 4. 
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As in Case 3, let A(1,1) = cL? + y, where c is an integer and y is a non-negative 


integer such that 0 < y < L?. 


Pi(u,v) — yPa(u,v) + R(u, v) ep A(u,v) — ¥ 
Py(u,v) L4P,(u, v) [4 


Let t = L4. Then, for each (u,v) € S(0, 41, 42,703), A(u,v) = y (mod L*). Hence, 
for each such pair (u,v), the expression a evaluates to an integer. Therefore, 


Ros = ao + od rf A(u,v)—¥ 
Pa(u,v)| L4 P2(u,v) L4 ; 


Our aim is to restrict the inputs such that the value of | PBteple Rte) | is fixed. We 


distinguish between two subcases, according to the value of 7. 


Subcase 4.1: y > 0. In this subcase we restrict the input, such that the value 
of [eee Rt) | is zero. Consider the polynomial V(z,y) = L7?(y P2(z,y) 
+R(z,y)). Since deg,(R) < deg,(P;) the leading coefficient of V is 7yL. Let 
B(x, y) = Pa(z,y) — V(x,y). The leading coefficient of B(x,y) is (1 — 7y)L. Since 
0< 4 <1, the leading coefficient of B(x, y) is positive, i.e. sign(B) = +1. Recall 


that = L*. Let, 


a = max{@1,77(B),m(yP2+R)} < D(D+1)4+1 


IA 


(D +1)’, 
and, 


a = max{@2,72(B),72(yP2+R)} < M?++((D+2)M)PH? +1 


IA 


2((D+2)M)?*?. 
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3. Po(z,y) is a constant, i.e., d, = dy = 0. 
4. 1=0, i.e. y does not appear in the leading monomial of P2(z, y). 


5. | > 0, i.e. the general case. 


Case 1: Pi(x,y) = P(v,y), for (e,y) € S(0,a1, 02,09) Let Q(2,y) = 1/1, m= 1, 


jo Noms 
a, = a; and a) = ag. 


Case 2: P;(z,y) ~ Po(z,y). Let B(z,y) = Po(z,y) — Pi(z,y). Since the leading coeffi- 
cient of P2 is positive and greater than the leading coefficient of P;, then sign(B) = +1. 
Using Lemma 2.3 let a = max{d1, 7i(B)} and a = max{d, 72(B)}. Lemma 2.3 guar- 
antees that for each (u,v) € S(0,a{,a4,1), B(u,v) > 0. Observe that B(u,v) > 0 implies 
that P2(u,v) > Py(u,v). Since aj > m(P2) and a4 > m2(P2), then P2(u,v) > 0. Thus, 


Py(u, v) 


13 
P,(u, v) 7 


For each (u,v) € S(0,a4,a5,1) both P,(u,v) > 0 and P2(u,v) > 0. Therefore, 


Clearly, deg(Q) = 0 and maz-coef{Q) = 1. Since deg(B) < D, and maz-coef(B) < 2M, 
then by Lemma 2.3, af < D+1 and ay < 2M +1. 


Case 3: P,(x,y) is the integer L > 1. Recall that all coefficients of P,;(z, y) are integers, 
and P;(z,y) > 0, for (t,y) € S(0, a1, 2,1). Let P\(1,1) = cL + 7, where c is an integer 
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Otherwise, let B(x,y) = P(z,y) — Q(z,y) = Alea Since P # Q then sign(B) # 0, 
which implies that sign(B,) # 0 and sign(B2) 4 0. By Lemma 2.2, deg(B) < 2D, and 
maz-coef(B) < 2(D + 1)?M?. Let a, = max{ am, ™(Bi), ™(B2)}, and a, = max{ 
Q2, T2(B,), 72(B2)}. Lemma 2.3 gives the bounds aj < max{a;,2D + 1} and a < 
max{a,,2(D + 1)?M? + 1}. Since the comparison P(z,y) > Q(z,y) is equivalent to 


B(z,y) > 0, the lemma follows. 0 


The next step is showing the effect of rational operations on our parameters. This 


case is a rather simple case. 


Lemma 4.4 Let P(z,y) = nes and Q(z,y) = ule be two multivariate rational 
expressions, where deg(P), deg(Q) < D, and maz-coef{(P), maz-coef(Q) < M. Then, 


deg(RoS) < 2D, and maz-coef(Ro S) < 2(1+ D)*M?, where o € {+,—,*,/, }. 


Proof: A special case of Lemma 2.2. QO 


In the following lemma we show how to “handle” the floor operation, which is the most 
interesting case. As for one variable, the main objective is to find a rational expression 
that coincides with the value of the floor operation on some subset of the inputs. In 
order to get control over the degree and maximum coefficient of the rational expression 
we restrict the input to a subset of the previous inputs. The proof is similar in spirit to 
Lemma 3.4, and is also done cases. The cases represent different relationships between 
the lexicographic order of the two polynomials in the rational expression (to which the 
floor is applied). The main conceptual difference with the one variable proof technique 


is in Case 5 of the proof, where we introduce new variables. 


Lemma 4.5 Let P(z,y) = Aiea be a rational expression with integer coefficients, 


defined over S(0,a1,a2,a@3), such that P(z,y) > 0 for (z,y) € S(0,a1,a2,a3). Define 
max{4, a,, deg(P)} = D, and max{4, a2, a3, maz-coef(P)} = M. Either 
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The intuition is that the sequence wuo,...,U;41 includes all the variables that we 
introduce. Notice that equations 4.1 and 4.2 imply that uo > u, > ug > ug >... > Uy > 


u,r41. The following definition relates the first pair (uo, u:), and the last pair (uy, ur41). 


Definition 7 Let S(r,a,,a2,a3,4,A) denote the following set of ordered pairs of 


positive integers: 


{(uo,ui) : there exist integers uy, u,r41 such that (uo, v1) 


is <1, Q1, 02,03, A,A > —generated by (ur, Ur4i)}. 


For convenience we omit the null vectors A and A whenever r = 0. In this case, the set 
S(0, a1, @2,@3) consists of all pairs (uo, u,) such that up > (ui)%, ui > a2, Uo = uy = 1 


(mod a3) (in accordance with the above definition). 


Perhaps the most important characteristic of the above definitions is its similarity to 
the Euclidean algorithm for solving the gcd problem. As an immediate consequence of 
the definition, we get the two properties stated below. These properties are the key to 


our proof strategy. 


Lemma 4.1 (The Correspondence Property): There is an one-to-one corre- 
spondence between the elements of the sets S(r,a ,Q2,a3,A,A) and S(0,a1, a2, a3). 
Specifically, each pair (uo,ti) € S(r,a1,a2,a3,A,A) corresponds to the unique pair 
(Uy, Urgi) € S(O, a1, @2,a3) such that (u,,Uy4i) ts the <r, a4, 2,03, A, A >-generator of 


(uo, ui). Furthermore, if (uo, ui) corresponds to (u,, ur41) then gcd(uo, u1) = gcd(ur, Uy41). 


Proof: Let az > a3 >... > a4; and bz > b3 >... > 5,41, be the generating sequences 
for (ao,a1) and (bo, 51), respectively. It is easy to check that if (ai,ai41) # (6:, 0:41) for 
some 0 <2 <r, then (a;,a;41) 4 (0;,;41) for any 7, 0 <j <r. The assertion about the 


gcd’s follows from the Euclidean algorithm. O 
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We give here a very brief description of the differences between the technique for 
two and one variables. The main difference between the two proof techniques is in 
the handling of the floor operation. Consider the expression lz]. If we assume that 
xz >y > 0, then we can claim that the value of this floor operation is zero. On the other 
hand, consider [z|. Since we assume that z > y > 0, we have to find a new solution for 
such an expression. We overcome the problem by adding a new variable, z, and requiring 
that both y > z > 0, and y+ z = x, which implies that |2| = 1. In some sense we 
transformed a computation based on the inputs z and y to a computation based on the 
input y and the “new input” z. The problem is that we introduce a new dependency 
between zx and y, e.g. in our example z = y+2z < 2y. The general solution has a similar 


flavor: we substitute z by Ay° + z. The values of 6 and \ are chosen to guarantee that 


previous requirements on the relationship between z and y hold. | 


The introduction of a new variable is the main difference between the technique for 
two variables and for one variable. Note that we may have to introduce many new 
variables, one for each floor operation. This causes additional complications in the proof 


technique that we need to resolve. 


Using this proof technique we show a lower bound on the depth of any computation 
tree with OP = {+,—,*,/, |-|}, that computes the gcd of all pairs of n-bit integers. The 
important part of the lower bound is that it is non-constant; quantitatively, the lower 


bound is Q(log log 7). 


This chapter is organized as follows. In section 4.2 we describe the way that we 
structure the inputs that will be used in the proof. In section 4.3 we develop the proof 
technique. In section 4.4 we use the proof technique to show a lower bound for computing 


the greatest common divisor of two integers. 
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3.3 Applications 


We start this section by characterizing a class of problems for which our technique gives 
non-trivial lower bounds. In the previous section we showed, that given a computation 
tree, we can find a path and a set of inputs S(n, A), such that all those inputs follow the 
path. The following defines an A/(n)-invariant function to be one which has two inputs 


in S(n, A), for which the output is different. 


Definition 5 Let f(x) be a boolean function whose domain is the set of n-bit integers, 
and let M(n) > 0. Then f is M(n)-invariant if, for any integer 1 < M(n), there are 
two n-bit integers a, and az, satisfying (i) ay = a2 =0 mod JA, and (ii) f(a) # f(aa). 


The following theorem states that the lower bound technique that we developed in 


the previous section gives a lower bound for any M(n)-invariant function. 


Theorem 3.6 Any decision tree with OP = {+,—,*,/,|-|} and constants {0,1}, 


that computes an M(n)-invariant function, f(x), for all n-bit integers, must have depth 


Q(,/log log M(n)). 


Proof: Since f is a boolean function, we can assume that the leaves are labeled by 
either zero or one. (This adds at most two to the depth of the tree.) We prove the 
lower bound by contradiction. Suppose that we are given a decision tree T of depth 
he< loglog M(n). that decides f. In Theorem 3.5 we prove that there is a \ < 
2m < M(n), such that all the inputs in S(n,X) reach the same leaf. Since f is M(n)- 


invariant, there are a, and a2, such that a1,a2 € S(n,) and f(a.) # f(az). 


Both a; and az follow the same path in T. Therefore T either accepts or rejects both 


f(a,) and f(az). This contradicts the fact that T computes f. Oo 


Clearly, the larger the value of M(n), the better the lower bound that we can show. On 


the other hand, since M(n) < 2”, we can not show lower bounds greater than 0(./log 7). 
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This enables us to transform all the floor operations, on a specific path, to evaluations 
of rational expressions. Lemma 3.2 gives a way to restrict the input such that the 
comparison value is fixed. This will enable us to find an integer \ and a path P, such 
that any integer in S(n, 2) follows the path P. Lemmas 3.2, 3.3, 3.4 also bound how 
fast the degree and the size of the coefficients of the rational expressions can grow. The 
bounds on the degree and maximum coefficients enable us to bound the value of A, which 
is essential for showing the lower bound on complexity. For example, note that if A > 2", 


then S(n, A) = @, which would not enable us to derive any interesting lower bounds. 


Theorem 3.5 Let T be an decision tree of depth hh with OP = {+,-,*,/,|-|}. There 
is an integer X < ot” and a leaf v; of T, such that for each input a € S(n,A), the 


computation follows the path from the root of T to its leaf v,. 


Proof: Denote the vertices on the path P from the root of T to its leaf vy; by v1, v2,..-, v1, 
in that order, where v; is the root of the tree T and v; is a child of v;_1. We define the 
path P and the integer \ inductively, starting with the empty path and \ = 1. (S(n,1) 
consists of all n-bit integers.) As part of the induction hypothesis, we maintain three 
properties of the path and the set under consideration. These properties are described 


below. 


Suppose that (i) we have selected a prefix of P, which starts at v;, and ends at a 


vertex v;41, and (ii) defined a parameter A“) with the following properties: 


1. For each input a € S(n, A) the computation follows the path from the root to 


Vi4l15 


2. For each computation vertex vy on the path from the root to the vertex vj41, ex- 
cluding the vertex v;41, there is a pair of polynomials (F,(z),G,(z)) with integer 


coefficients, such that for each input a € S(n, A), G_(a) # 0, and val(v) = aor 
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Case 3: P2(x) ~ P,(r). Let L be the leading coefficient of P,(z). Note that since 
sign( P2) = +1, then L > 0. Corollary 2.7 implies that 


= A(x) P2(x) + R(z) 


where A(x) and R(z) are polynomials with integer coefficients such that, maz-coef{(A) < 


22 M+), maz-coef(R) < 22+1M?+?, deg( A) < D and deg(R) < D. 


Consider the constant term of A(z). We denote this constant by cL4+7, where c is an 
integer and ¥ is a non-negative integer such that 0 < y < L*. Let A(x) = A(x) +cL?+7; 
that is, A(z) is equal to the polynomial A(z) minus its constant term. The parameter 
x will be chosen to be a multiple of L?. This implies that for each a € S(n, Am), each 
monomial of L~¢A(a) evaluates to an integer. Hence, for each integer a € S(n, Ar), 


[Fray| Ze +e+ || 


We distinguish between two subcases: 


Subcase 1: y > 0. Consider the polynomial V(r) = L~4(yP,(x) + R(x)). The 
leading coefficient of V is L~4(yL), therefore sign(V) = +1. Let B(x) = P2(x) — 
V(z). The leading coefficient of B(z) is (1 — L~*y)L. Since 0 < L~4y < 1, the 
leading coefficient of B(z) is positive, i.e. sign(B) = +1. Using Corollary 2.4, let 
be the minimum multiple of L¢ such that t > max{m(B), x(yP2+R), (Pi), 7(P2)}. 
Consider a € S(n, Aw). Since r > 7(B), then V(a) < P2(a). Since r > r(yP2+R), 
then 0 < V(a), which implies that 0 < V(a) < P2(a). Since > m(P2), then P,(a) 
> 0. This implies that 0 < 44 = vee) <1, for any a > 7. We conclude 


P2(a) 


that for each a € S(n, An), | is = L~4A(a) +c. Let 


d 
= (P(e). for ne S(n, Ax). 


30 CHAPTER 3. ONE VARIABLE FUNCTIONS 


The following claim shows how we can restrict the input by changing the parameter of 
S(-). Namely, multiplying the parameter by another integer and considering only integers 


that are divisible by the product. 
Claim: 3.1 For any integer 7 > 1, S(n,Amr) C S(n,A). 


The following lemma enables us, each time the computation reaches a comparison 
vertex, to fix one of the directions. This involves restricting the inputs, from S(n, A) to 


a subset of it, S(n, Aw). An important part of the lemma is to bound the value of 7. 


Lemma 3.2 Let P(x) and Q(x) be two rational expression, of degree at most D and 
maximum coefficient at most M, defined over S(n,). There eae m<2(1+D)M? +1, 
such that for all x € S(n,A7), the comparison P(x) > Q(x) is determined (i.e. either 
P(x) = Q(z), P(x) > Q(z), or P(x) < Q(x), for x € S(n,Am)). 

Proof: Let B(z) = P(x) — Q(z) = Be. If P(x) = Q(z) for z € S(n, A), let tr = 1, and 
the lemma follows. Otherwise both sign(B,) # 0 and sign(B2) 4 0. By Lemma 2.2 the 
degree of B is at most 2D and maz-coef{B) < 2(.D + 1)M?. 


Corollary 2.4 guarantees the existence of a positive integer = max{7(B,), 7(B2)}, 
such that for all integers z > 7, either B(x) > 0, or B(x) < 0. Furthermore, it guarantees 
that  < 2(D + 1)M? +1. Since the expression P(r) > Q(z) is equivalent to B(x) > 0, 


the lemma follows. Oo 


Next, we consider the case of a computation vertex. First we discuss rational opera- 


tions, and restate Lemma 2.2, for univariate rational expressions. 


Lemma 3.3 Let P and Q be two rational expressions, of degree at most D and maz- 
imum coefficient at most M, defined over S(n,r). Let R(x) = P(r) 0 Q(z), where 
o€ {+,—,+*,/}. Then, deg(R) < 2D and maz-coef{R) < 2(D +1)M?. 
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Essentially, the degree argument uses the fact that a polynomial cannot change “too 
fast”. Namely, a polynomial of degree d can change its sign at most d+1 times. Therefore 
a polynomial of degree d can “disconnect” the inputs to at most d+1 different connected 
components. The number of connected components of the function to be computed 
implies a lower bound on the degree of the polynomial, which implies a lower bound on 


the depth. 


The degree argument can be extended to rational functions, and also in higher 
dimensions. The arguments for higher dimensions are much more complex than for 
one dimension, and they are the main part of the lower bound technique developed in 


[BO83, SY82, Yao89]. 


Adding the floor function as a basic operation changes the situation dramatically. 
First, using the floor function, one can trivially compute the parity function in O(1) 
steps. Second, the floor function may increase the number of connected components in 
an unbounded way. For this reason the number of connected components is not relevant 


in computations with the floor function. 


In the remainder of this section we give a general overview of the proof technique 
that we develop. Given a decision tree, the proof constructs a path from the root to a 
leaf, such that a certain set of inputs follows that path. The property of the set, is that 
it includes all the integers that are divisible by a certain integer \. A main part of the 
proof technique is to bound X as a function of the depth of the decision tree, i.e. to show 
that A < g(h), where h is the depth of the tree. In Section 3.3 we show applications 
of this proof technique. We define a boolean function to be M(n)-invariant if for any 
integer A < M(n), there are at least two n bit integers, both divisible by A, for which 
the value of the function differs. We argue that any M(n)-invariant boolean function 


requires depth at least h, where g(h) = M(n). 


As one can expect, the main part of the proof technique is devoted to handling the 


floor function. The main idea is to transform the floor function to a rational function. 
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For univariate polynomial the situation is even simpler, and is summarized in the 


following corollary of Lemma 2.5. 


Corollary 2.7 Let P(x) and Q(x) be two polynomials with integer coefficients, of 
degree at most D, and mazimum coefficient at most M. Let L be the leading coefficient 
of Q(x), then there exists A(x) and R(x) such that, 

1 1 
P(t) = ppp A(2)Q(z) + FeO), 
where A(x) and R(z) are polynomials with integer coefficients, and maz-coef( A) < 2? M+}, 


maz-coef(R) < 2?+!1M?+?, deg(A) < D, and deg(R) < D. 
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R(z,y) such that 


P(2,) = per Ala, vIQ(2.4) + Gaz Alt), 
where A(z,y) and R(z,y) are polynomials with integer coefficients. Furthermore, 
1, mar-coef(R) < (2 + deg,(Q))’*?MN**, 
2. deg,(R) < deg,(Q){= d}, and deg,(R) < deg,(P) + (6 + 1)deg,(Q). 
3. deg,(A) < max{0,6}, and deg,(A) < deg,(P), and 


4. maz-coef(A) < (2 + deg,(Q))>MN®. 


Proof: The proof is by induction on 6. The hypothesis holds for the basis case 6 = —1 
with A(z,y) =0 and A(z,y) = P(z,y). 


For the induction step, assume that the hypothesis holds for all 6 < k, for some 
k > —1. We prove it for k. Let P(z,y) = pi(y)z® + po(y)z* 1 4+..., be such that 
k = e —d. Consider the polynomial 


S(z,y) = LP(x,y) — t*pi(y)Q(z, y). 


One can verify that, maz-coef(S) < (2+ deg,(Q))MN, deg,(S) < deg,(P) — 1, and 
deg,(S) < deg(P) + deg,(Q). 


Applying the hypothesis to the pair S(z,y) and Q(z, y), yields 


Sey) = TA'(e,y) Qu) + Ray). 


Substituting for S(z,y), we get 


Play) = Taz (A (ey) + Lo'ni(y)) (2.9) + Gag R (9): 
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Bivariate and univariate polynomials 


A bivariate polynomial is a polynomial with two variables. Below, we relate the lexico- 
graphic order defined on the bivariate polynomials, and the order among their values at 
certain points. We concentrate on bivariate and univariate polynomials since those are 


the cases that we would be interested in later. 


Consider a bivariate polynomial P(z,y). We would like to give a simple sufficient 
condition on the inputs of P, such that for any input (x,y) that satisfies the condition, 
the sign of P(x, y) would be the same as sign(P). The sufficient condition that we give in 
the lemma below guarantees that the value of the leading monomial is larger than all other 
monomials combined. This is clearly sufficient to ensure that sign(P(a,6)) = sign(P), 


for such inputs (a, d). 


Lemma 2.3 For each bivariate polynomial P(z,y), such that sign(P) # 0, there 

exist positive integers ™(P) and 2(P) such that for all (a,b) satisfying a > 6") and 
b> m2(P), sign(P(a,6)) equals sign(P). Furthermore, m(P) < deg,(P)+1, and m2(P) < 
ma-cochPy +l where L is the leading coefficient of P. 
Proof: Let M = maz-coef(P). Let P(z,y) = wig Lyx'*y**, where P(z,y) is written in 
its normal form, and Lx is the coefficient of the kth monomial (Lo = L). Note that since 
sign(P) #0, then L a 0. Denote t,(z,y) = z*y*. Let 71(P) = 14+maxoce<m—1{0, Jeti — 
je}, and 72(P) = ““. Clearly, m(P) < deg,(P) +1. 


From the lexicographic order it follows that sey) = z'y?, where either? < 0 andj < 
m(P) ori =0 andj <0. Thus, if a > 67?) then tata (oo) < }, and hence, Cs < x. 
Suppose that L > 0, i.e. sign(P) = +1. We show that for (a,b), satisfying a > 67?) 


and 6 > m2(P), P(a,b) > 0, i.e. sign(P(a, b)) = +1. For (a,6), such that a > 67"), 


[Li.| 


P(a,b) > L to(a,b) — = [Zela'tb* > L to(a, b)(1 — ps Tok 
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Proof: We only prove the bound on maz-coef{(P *@Q). (The other bounds are straightfor- 
ward.) Note that a multivariate polynomial of degree d has at most (1 + d)* monomials. 
Therefore, when we multiply two multivariate polynomials, each of the coefficients of the 
product is the sum of at most (1 + min{deg(P), deg(Q)})* terms, each of them being 
the product of a coefficient in P by a coefficient in Q. Since the product of a coeffi- 
cient of P and a coefficient of @ is bounded by maz-coef{P) maz-coef(Q), the bound on 
maz-coef( P * Q) follows. Oo 


A rational number is a number that can be expressed as n/m, where n and m are 


gooey: 


P and Q are polynomials. 


P(zy 45.6, Lf 


al CT where P and Q are multivariate 


For a rational expression R(z1,...,2%) = 
polynomials, define the degree of R to be the larger of the degrees of P and Q. Similarly, 
define the marimum coefficient of R to be the larger of the maximum coefficients of P 


and Q. 


Lemma 2.2 Let R(2,...,0%) = PY and S(ay,...,t,%) = Ce be two 


multivariate rational expressions. Then, 


1. deg R+S) < deg(R) + deg(S), 
2. maz-coef(R +S) < 2(1 + min{ deg( R), deg(S)})* maz-coef{ R) maz-coef(S), 
3. deg R* S) < deg(R) + deg(S), and 


4. maz-coef(R * S) < (1 + min{ deg( R), deg(S)})* maz-coef{ R)maz-coef(S). 


Proof: The bounds follow from Lemma 2.1 and from the fact that for o € {+,-}, 


Ro S$ = BGP, 0 
2Q2 


18 CHAPTER 2. PRELIMINARIES 


The computation terminates at a leaf u and outputs the value of u. The time com- 
plezity of a given input is the length of the path that the computation traverses using 
that input. The time complezity of a computation tree is the maximum time complexity 
over all inputs, which is equal to the depth of the tree!, assuming that each vertex can 


be reached by some computation. 


As one can note, we did not mention how constants appear in the tree. We can 
introduce constants as additional operations, and view a constant as a constant function. 
In many cases it is simpler to state separately the basic operations and the allowed 
constants. However, the formal interpretation of such a statement is that there is one set 


of basic operations, that includes the constants and basic operations. 


A decision tree is a computation tree whose output values are either 0 or 1. For 
decision trees we can assume that each leaf is labeled by a constant, either 0 or 1. This 


can increase the depth of the tree by an additive factor of at most two. 


A straight line program is a computation tree that does not include any comparison 


vertices. 


The operations {+,—,*} are defined in the natural way. The rational division opera- 
tion returns a rational function of the inputs, e.g. 2/3 = 0.666---. (In case a division by 
zero occurs during a computation, the output of the entire computation is undefined.) 
The set of rational operations includes the operations {+,—,*,/}. We define the floor 
operation in the following way. The floor operation receives as an operand a non-negative 


real number and returns the largest integer smaller than or equal to the operand. 


Our assumption that the operand to the floor function is non-negative simplifies our 
proofs and does not change the depth of the computation tree by more than a constant 
factor. Using this floor operation, one can implement either a general floor operation 


(i.e. the operand is an arbitrary real number) or a mod operation using O(1) depth. 


‘The depth of a tree is the length of a longest path from the root to a leaf. 
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However, this is not the case in general. In most cases we would desire some con- 
ditional branching mechanism. The time complexity of a program that includes such a 
mechanism cannot be estimated by the number of instructions that appear in the pro- 


gram. This is best illustrated in the following example. Consider the following program: 


z:= 0 


WHILE y >1DO 


Zi=2z7+er 
y:r=y-l 
END WHILE 


For y € [1,n], the complexity of the above procedure is O(n) operations. The follow- 


ing program, that looks very similar requires only O(log n) operations. 


WHILE y > 1 DO 


zZi=2z2+¢2 


y= y/2 
END WHILE 


In order to avoid such problems, we “unroll” the loop structure. This means that the 
program is represented as a tree, possibly an infinite tree. The time complexity of a given 
input is the length of the path in the tree traversed by the input. We would consider 


only finite trees. 


After the above motivation, we can give a precise definition of the computation tree 


model. 
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Unlike much of the pr _..s research, our work concentrates on functions with a 
constant number of input variables. This allows us to get a better understanding of the 
complexities that are involved with the floor operation. Since there are only a constant 
number of inputs, we define the complexity measures as a function of the complexity of 
the input, e.g., the number of bits that represent an integer input. We develop a general 
technique for deriving lower bounds for such functions. In combination with the lower 
bounds, we show upper bounds that use the floor function in new and novel ways. In 
some cases we are able to derive matching lower and upper bound, up to a constant 


multiplicative factor. 


We develop a general technique to handle functions of a single n-bit integer input. 
Using this technique we derive 2(./log 7) lower bounds for problems such as computing 
[log log x}, deciding if the input is a perfect square (i.e. whether its square root is also an 
integer), and other problems. For computing |log log x], we give a O(./logn) algorithm, 
and thus show that this lower bound is tight. (Note that using only rational operations, 


computing [log log x| requires O(log n) operations. ) 


We extend this technique to handle two variable functions, and show that computing 
the greatest common divisor of two integers requires a non-constant number of operations. 
More precisely, we show an (log logn) lower bound for computing the greatest common 
divisor of two n-bit integers. This lower bound holds even if we restrict our attention to 


deciding if a pair of integers are relatively prime or not. 


We also develop a lower bound technique for approximating real valued functions. 
We consider approximating the s” root of a real number from the interval [1,2], on 
the “average”, where s is a fixed constant. We define an average approximation by 
considering the function which gives the absolute difference between the approximated 
value and the correct value, for each z € [1,2], and integrating this function over the 
interval [1,2]. The value of this integral is the “average error” of the approximation. 


th 


We show that in order to approximate the s™ root of an input from [1,2], such that 
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At the beginning of this decade, a general technique, based on topological arguments, 
was developed in [BO83, SY82]. The technique relates the logarithm of the number 
of connected components of a decision problem and the number of rational operations 
the problem requires. Based on this technique, many problems, whose inputs are either 
real or rational numbers, have been given tight bounds. This technique, under some 


restrictions, was extended to handle integral input by [Yao89]}. 


Another model that uses abstract arithmetic operations, rather than explicit bit op- 
erations, is the strongly polynomial model. The motivation for this model is to achieve 
polynomial time algorithms that would not depend on the representation of the input. 
This would allow inputs with infinite representation, as in the case of real numbers. The 
aim is that the number of operations would be polynomial in the number of inputs, and 
not depend on the size of the input. Grétschel, Lovasz, and Schrijver, in their book “Ge- 
ometric Algorithms and Combinatorial Optimization”, asked if there exists a strongly 
polynomial algorithm for the greatest common divisor of two integers. (See [GLS88], 
pp. 32-33, p. 225.) Notice that since there are only two integer inputs to the problem, 
any strongly polynomial algorithm for this problem must have a constant number of 


arithmetic operations. 


There are two “standard” sets of operations that are considered in strongly polynomial 
computations. The weaker model considers only rational operations, i.e. {+,—, *,/}. 
The more powerful model has an additional “rounding down” operation (i.e. floor). 
Stockmeyer [Sto76] proves an Q(n) lower bound for deciding if an n-bit integer is odd 
or even, in the weaker model, where only rational operations are allowed. We show in 
this work that there is no strongly polynomial algorithm for the greatest common divisor 
problem in the stronger model, where rounding operations are allowed. Our result implies 


that strongly polynomial time is different from polynomial time. 


When defining strongly polynomial computations one normally restricts the size of 


the integers involved in the computation to have size polynomial in the size of the original 


CONTENTS 


5 Approximation of Real Functions 


5.1 


5.2 


5.3 


5.4 


5.5 


Motivations: <5 is ein Fie 8 bh 0 ela ee Se eGo ark eee & 
DOR MIIONG ecges fest ae vals Se ae ee a Oe a Bh 
Overview... eee 
Lower bound technique |. oo. ee cc cece sec ce sacs. 


Lower bound for approximation .. 0.2... 000 cee ecw een 


6 Upper Bounds anitowbuy tad 


6.1 


6.2 


6.3 


Computing polynomials ...........0...00 0 0c eeeeaee 
eecteructiess 
Approximations... 2... 1 1. ee ee ee eee eae 


Fog’. 


. lobo sat! penezegmon 


‘Computing powers of two 2... 0... ee ee ees 


: Li a Poa aenteg ete 
potted LAGOS Daa 2b 


7 Conclusions 


enoitoauy’ad siduius? sa 


: ; rr oh - 6 
up In tosd Feorh Nis 


- ee 3 2 of 
. CHOU ENUIGER GS 


£J"323 sol booed rewad 


pee GHAS, 


aa 


Jigdsidie aleagds Nu 
gupludoss boo a) Ge 


huanuod vol wed £.B 


° 
Pi 
#, 


£ 


ee 


62 


73 


74 


87 


I would like to thank Mike Sipser for discussion that we had in the early stages of the 
research, discussion that helped me to define the problems in the early stages of the 
research. I would like to thank Larry Stockmeyer who served on my thesis committee. 


His numerous comments about this thesis have greatly to improve its presentation. 


I would like to thank the many friends I had in the theory group, especially Mark 
Newman, Noam Nisan, Phill Rogaway, Arie Rudich and Nir Shavit. I would like to 
thank all the theory group for the unique atmosphere that makes the research process a 


pleasure. 


I would like to thank Nina Wiener and ISEF for both their support and assistance 


during my graduate study, help which is well appreciated by me. 


I am grateful to IBM where I spent two summer, and for the financial support I 
received through the IBM graduate student fellowship. I would like to thank David 
Johnson and AT&T, for the summer I spent there. 


A special thanks to my parents Aviva and Jacob, for their continuous love, support 
and encouragement during all the many years of my studies. I would also like to thank 


Taly, Ido, and Yuval, for their love and encouragement. 


The person that I am most in debt to for helping me through my graduate years is 
my spouse Lea. Her love and encouragement have always helped me to bounce back to 
the right track. There are no words that can express how important her support has 


been for me. 


common divisor of two integers requires a non-constant number of operations; such 
a lower bound would separate polynomial time and strongly polynomial time. 


We resolve this open problem and show that there is no computation tree that 
computes the greatest common divisor in a fixed number of operations; in fact, 
we prove an (2(loglogn) lower bound for computing the greatest common divisor 
of two n-bit integers. Thus separating polynomial time and strongly polynomial 
time. 


3. Approximation problems. In this category we consider the complexity of computing 
an approximation to the s™ root of a real number. The notion of approximation 
that we consider is “on the average”, which is modeled by an £; norm. We show a 
Q(,/log log *) lower bound for approximating the s“ root of inputs in the interval 
[1,2], within € on the average. (The inputs to the computation, in this case, is the 


number for which the s“ root is approximated.) 


We also show interesting upper bounds. Using Newton’s method one can approx- 
imate the s root in O(log log +) rational operations, and this bound is tight (for 
rational operations). Using the floor function, one can accelerate the computation, 
and compute an approximation in 2(,/log log +) operations. (The input to the com- 


putation, in this case, are both the number for which the s“ root is approximated, 
and the error parameter «.) 
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